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@ Method for the expression of genes In plants. 

@ A method for the expression of genes in plants, parts of 
plants, and plant cell cultures, in which a DNA fragment Is 
used comprising an inducible plant promoter of root nodule- 
specific genes, DNA-fragments comprising an inducible 
plant promoter, to be used when carrying out the method, 
said DNA-fragments being identical with, derived from or 
comprising a 5* flanking region of root nodule : specific 
genes of any origin as well as plasm ids and transformed 
Agrobacterium rhlzogenes-strain which can be used when 
^ carrying out the method. 
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A method for the ex pression of genes in plants, 
parts of plants, and plant cell cultures, and DNA 
fragments, plasmids. and transformed microorganisms 
to be used when carrying out the method, as well 
5 as the use thereof for the expressi on of genes in 
plants, parts of pl ants, and plant cell cultures, 

The invention relates to a novel method for the 
expression of genes in plants, parts of plants, 
and plant cell cultures, as well as DNA fragments 
lO and plasmids comprising said DNA fragments to be 
used when carrying out the method. The invention 
furthermore relates to transformed plants, parts 
of plants and plant cells. 

The invention relates to this method for the ex- 
15 pression of genes of any origin under control of 
an inducible, root nodule specific promoter. 



The invention relates 
for the expression of 
in transformed plants 
20 plants and other plants 



especially to this method 
root nodule - specif ic genes 
including both leguminous 



The invention relates furthermore to DNA fragments 
comprising an inducible plant promoter to be used 
when carrying out the method, as well as plasmids 
comprising said DNA fragments. 

25 In the specification i.a. the following terms are 
used : 

Root nodule-specific genes: Plant genes active 
only in the root nodules of leguminous plants, or 
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genes with an increased expression in root nodules. 
Root nodule-specific plant genes are expressed at 
predetermined stages of development and are ac- 
tivated in a coordinated manner during the symbiosis 
5 whereby a nitrogen fixation takes place and the 
fixed nitrogen is utilized in the metabolism of 
the plant. 

Inducible plant promoter: Generally is meant a 
promoter- active 5' flanking region from plant genes 

lO inducible from a low activity to a high activity. 
In relation to the present invention "inducible 
plant promoter" means a promoter, derived from, 
contained in or being identical with a 5' flanking 
region including a leader sequence of root nodule- 

15 specif ic genes and being capable of promoting and 
regulating the expression of a gene as characterised 
in relation to the present invention. 

Leader sequence: Generally is meant a DNA sequence 
being transcribed into a mRNA, but not further 

20 translated into protein. The leader sequence com- 
prises thus the DNA fragment from the start of the 
transcription to the ATG codon constituting the 
start of the translation. In relation to the present 
Invention "leader sequence" means a short DNA frag- 

25 ment contained in the above inducible plant promoter 
and typically comprising 40-70 bp and which may 
comprise sequences being targets for a posttran- 
scriptional regulation . 

Promoter region: A DNA fragment containing a pro- 
3Qmoter which comprises target sequences for RNA 
polymerase as well as possible activation regions 
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comprising target sequences for transcriptional 
effector substances. In the present invention, 
target sequences for transcriptional effectors may 
also be situated 3' to the promoter, i.e. in the 
5 coding sequences, the intervening sequences or on 
the 3' flanking region of a root nodule - specif ic 
gene . 

Furthermore a number of molecular-biological terms 
generally known to persons skilled in the art are 
lO used, including the terms stated below: 

CAP (addition) site: The nucleotide of the 5' end 
of the transcript where 7-methylGTP is added; In 
the Figures often given also as an asterisk *-marked 
nucleotide on a given nucleotide sequence. 

15 DNA sequence or DNA s egment: A linear array of 
nucleotides interconnected through phosphodies ter 
bonds between the 3 r and 5' carbon atoms of adjacent 
pentoses . 

Expression: The process undergone by a structural 
20 g ene to produce a polypeptide. It is a combination 
of transcription and translation as well as possible 
pos ttranslational modifications. 

Flanking regions : DNA sequences surrounding coding 
regions. 5' flanking regions contain a promoter. 
25 3' flanking regions may contain a transcriptional 
terminator etc. 



Gene : A DNA sequence composed of three or four 
parts, viz. (1) the coding sequence for the gene 
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product, (2) the sequences in the promoter region 
which control whether or not the gene will be ex- 
pressed, (3) those sequences in the 3' end con- 
ditioning the transcriptional termination and op- 
5 tionally polyadenylation , as well as (4) interven- 
ing sequences, if any. 

Intervening sequences : DNA sequences within a gene 
which are not coding for any peptide fragment. The 
intervening sequences are transcribed into pre-mRM 
lO and are eliminated by modification of pre-mRNA 
into mRNA. They are also called introns . 

Chime ric gene: A gene composed of parts from various 
genes. E.g. the chimeric Lbc3 - 5 ' - 3 ' - CAT is composed 
of a chloroamphenicolacetyltransf erase -coding se- 
15 quence deriving from E . coli and 5' and 3' flanking 
regulatory regions of the Lbc3 gene of soybean. 

Cloning: The process of obtaining a population of 
organisms or DNA sequences deriving from one such 
organism or sequence by asexual reproduction, or 
20more particular a process of isolating a particular 
organism or part thereof, and the propagation of 
this subfraction as a homogeneous population. 

Coding sequences : DNA sequences determining the 

amino acid sequence of a polypeptide. 

25 Cross - inoc ulation group : A group of leguminous 
plant species capable of producing functionally 
active root nodules with Rhizobium bacteria isolated 
from root nodules of other species of the group. 
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Leghemoglobin (Lb) : An oxygen-binding protein ex- 
clusively synthesized in root nodules. The Lb pro- 
teins regulate the oxygen partial pressure in the 
root nodule tissue and transport oxygen to the 
5 bacteroides. In this manner the oxygen- sens itive 
nitrogenase enzyme is protected. The Lb genes are 
root nodule-specific genes. 

Messenger-RNA (mRNA) : RITA molecule produced by tran- 
scription of a gene and possibly modification of 
10 mRNA. The mRNA molecule mediates the genetic message 
determining the amino acid sequence of a polypeptide 
by part of the mRNA molecule being translated into 
said peptide. 

Downstream : A position in a, DNA sequence. It is 
15 defined relative to the transcriptional direction 
5'- 3' of the gene relative to which the position 
is stated. The 3' flanking region is thus posi- 
tioned downstream of the gene. 

Nucleotide : A monomeric unit of DNA or RNA con- 
20 sisting of a sugar moiety (pentose), a phosphate, 
and a nitrogeneous heterocyclic base. The base is 
linked to the sugar moiety via a glycosidic bond 
(1' carbon of the pentose), and this combination 
of base and sugar is a nucleoside. The base cha- 
25 racterises the nucleotide. The four DNA bases are 
adenine (A), guanine (G) , cytosine (C), and thymine 
(T) . The four RNA bases are A, G, C, and uracil (U) . 

Up s tream : A position in a DNA sequence. It is de- 
fined relative to the transcriptional direction 
30 5 ' - 3 f of the gene relative to which the position 
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is stated. The 5' flanking region is thus positioned 
upstream of this gene. 

Plant transformation: Processes leading to incor- 
poration of genes in the genome of plant cells in 
5 such a manner that these genes are reliably in- 
herited through mitosis and meiosis or in such a 
manner that these genes are only maintained for 
short periods. 

Plasmid: An extra- chromosomal double - s tranded DNA 
lO sequence comprising an intact replicon such that 
the plasmid is replicated in a host cell. When the 
plasmid is placed within a unicellular organism, 
the characteristics of that organism are changed 
or transformed as a result of the DNA of the plas- 
15 mid. For instance a plasmid carrying the gene for 
tetracycline resistance (Tc R ) transforms a cell 
previously sensitive to tetracycline into one which 
is resistant to it. A cell transformed by a plasmid 
is called a transf ormant . 

20 Polypeptide : A linear array of amino acids inter- 
connected by means of peptide bonds between the 
<x-amino and carboxy groups of adjacent amino acids. 

Recombination : The creation of a new DNA molecule 
by combining DNA fragments of different origin. 

25 Homologous recombination: A recombination between 
sequences showing a high degree of homology. 



Replication : A process reproducing DNA molecules. 
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Replicon: A self - replicating genetic element pos- 
sessing an origin for the initiation of DNA re- 
plication and genes specifying the functions neces- 
sary for a control and a replication thereof. 

5 Restriction fragment: A DNA fragment resulting from 
double - stranded cleavage by an enzyme recogniz ing 
a specific target DNA sequence. 

RNA polymerase: Enzyme effecting the transcription 
of DNA into RNA. 

1Q Root nodule : Specialized tissue resulting from 
infection of mainly roots of leguminous plants 
with Rhizobium bacteria. The tissue is produced by 
the host plant and comprises therefore plant cells 
whereas the Rhizobium bacteria upon infection are 

15 surrounded by a plant cell membrane and differen- 
tiate into bacteroides. Root nodules are produced 
on other species of plants upon infection of nitro- 
gen-fixing bacteria not belonging to the Rhizobium 
genus. Root nodule - spec if ic plant genes are also 

20 expressed in these nodules. 

Southern-hvbridization: Denatured DNA is transferred 
upon size separation in agarose gel to a nitro- 
cellulose membrane. Transferred DNA is analysed 
for a predetermined DNA sequence or a predetermined 

25 gene by hybridization. This process allows a binding 
of single-stranded, radioac t ively marked DNA se- 
quences (probes) to complementary s ingle - s tranded 
DNA sequences bound on the membrane. The position 
of DNA fragments on th membrane binding the probe 

30 can subs quently be detected on an X-ray film. 
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Symbiotic nitrogen fixati on ; T h ft relationship where- 
by bacteroides of root nodules convert the nitrogen 
(dinitrogen) of the air into ammonium utilized by 
the plant while the plant provides the bacteroides 
5 with carbon compounds as a carbon source. 

Svmbiont : One part of a symbiotic relationship, and 
especially Rhizobium is called the microsymbiont . 

Transformation ; The process whereby a cell is incor- 
porating a DNA molecule. 

lO Translation; The process of producing a polypeptide 
from mRNA or: 

the process whereby the genetic information present 
in a mRNA molecule directs the order of specific 
amino acids during the synthesis of a polypeptide. 

15 Transcription:, The method of synthesizing a com- 
plementary RNA sequence from a DNA sequence. 

Vector: A plasmid, phage DNA or other DNA sequences 
capable of replication in a host cell and having 
one or a small number of endonuclease recognition 
2o sites at which such DNA sequences may be cleaved 
in a determinable manner without loss of an es- 
sential biological function. 

Traditional plant breeding is based on repeated 
crossbreeding of plant lines individually carrying 
25 desired qualities. The identification of progeny 
lines carrying all the desired qualities is a par- 
ticularly time-consuming process as the biochemical 
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and genetic basis of the qualities is usually un- 
known. New lines are therefore chosen according to 
their phenotype, usually after a screening of many 
lines in field experiments. 

5 Through the ages a direct connection has existed 
* between the state of nutrition, i.e. the health, 
of the population and the agricultural possibility 
of ensuring a sufficient supply of assimilable 
nitrogen in order to obtain satisfactory yields. 

10 Already in the seventeenth century it was discovered 
that plants of the family leguminosae including 
beyond peas also beans , lupins, soybean, bird's-foot 
trefoil, vetches, alfalfa, sainfoin, and trefoil had 
an ability of improving crops grown on the habitat 

15 of these plants. Today it is known that the latter 
is due to the fact that the members of the plants 
of the family leguminosae are able, to produce nitro- 
gen reserves themselves. On the roots they carry 
bacteria with which they live in symbiosis. 

20 An infection of the roots of these leguminous plants 
with Rhizobium bacteria causes a formation of root 
nodules able to convert atmospheric nitrogen into 
bound nitrogen, which is a process called nitrogen 
fixation . 

25 Atmospheric nitrogen is thereby converted into forms 
which can be utilized by the host plant as well as 
by the plants later on growing on the same habitat. 

In the nineteenth century the above possibility was 
utilized for the supply of nitrogen in order to 
30 achieve a novel increase of the crop yield. 
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The later further Increases in the yield have, how- 
ever, especially been obtained by means of natural 
fertilizers and nitrogen-containing synthetic fer- 
tilizers* The resulting pollution of the environ- 
5 merit makes it desirable to provide alternative 
possibilities of ensuring the supply of nitrogen 
necessary for the best possible yields obtainable. 

It would thus be valuable to make an improvement 
possible of the existing nitrogen fixation systems 
lO in leguminous plants as well as to allow an in- 
corporation of nitrogen fixation systems in other 
plants . 

The recombinant DNA technique and the plant trans- 
formation systems developed render it now possible 

15 to provide plants with new qualities in a well- 
controlled manner. These characteristics can derive 
from not only the same plant species, but also 
from all other prokaryotic or eukaryotic organisms. 
The DNA techniques allow further a quick and spe- 

20 cif ic identification of progeny lines carrying the 
desired qualities. In this manner a specific plant 
line can be provided with one or more desired qual- 
ities in a quick and well-defined manner. 

Correspondingly, plant cells can be provided with 
25 well defined qualities and subsequently be main- 
tained as plant cell lines by means of known tissue 
culture methods. Such plant cells can be utilized 
for the production of chemical and biological prod- 
ucts of particular interest such as dyes, flavours, 
30 aroma components, plant hormones, pharmaceutical 
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products, primary and secondary metabolites as 
well as polypeptides (enzymes) . 

A range of factors and functions necessary for 
biological production of a predetermined gene pro- 
5 duct are known. Both the initiation and regulation 
of transcription as well as the initiation and 
regulation of pos ttransscr ip tional processes can 
be characterised. 

At the gene level it is known that these functions 
10 are mainly carried out by 5' flanking regions. A 
wide range of 5' flanking regions from prokaryotic 
and eukaryotic genes has been sequenced, and in 
view inter alia thereof a comprehensive knowledge 
has been provided of the regulation of gene ex- 
15 pression and of the sub-regions and sequences being 
of importance for the regulation of expression of 
the gene. Great differences exist in the regulatory 
mechanism of prokaryotic and eukaryotic organisms , 
but many common features apply to the two groups. 

20 The regulation of the expression of gene may take 
place on the transscrip tional level and is then 
preferably exerted by regulating the initiation 
frequency of trans scrip tion . The latter is well- 
known and described inter alia by Benjamin Lewin, 

25 Gene Expression, John Wiley & Sons, vol. I, 1974, 
vol. II, Second Edition 1980, vol. Ill, 1977. As 
an alternative the regulation may be exerted at 
the pos ttransscr ip tional level, e.g. by the re- 
gulation of the frequency of the translation ini- 

30 tiation, at the rate of the translation, and of 
the termination of the translation. 
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The present invention is based on the surprising 
finding that 5' flanking regions of root nodule- 
specific genes, exemplified by the 5' flanking 
region of the soybean leghemoglob in Lbc3 gene, can 
5 be used for inducible expression of a foreign gene 
in an alien leguminous plant. The induction and 
regulation of the promoter is preferably carried 
out in the form of a regulation and induction at 
the transcriptional level and differs thereby 
10 from the inducability stated in Patent Application 
No. 86114704. 9 „ the latter inducability preferably 
being carried out at the translation level. 

The transscription of both the Lbc 3 gene of the 
soybean and of a chimeric Lbc3 gene transferred to 

15 bird's-foot trefoil starts at a low level immediate- 
ly upon the appearance of the root nodules on the 
plant roots. Subsequently, a high increase of the 
transscription takes place immediately before the 
root nodules turn red. The transcription of a range 

20 of other root nodule - specif ic genes is initiated 
exactly at this time. The simultaneous induction 
of the transscription of the Lb genes and other 
root nodule-specific genes means that a common DNA 
sequence(s) must be present for the various genes 

25 controlling this pattern of expression. Thus the 
leghemoglob in- C3 gene is a representative of one 
class of genes and the promoter and the leader 
sequence , target areas for activation as well as 
the control elements of the organ specificity of 

30 the Lbc 3 gene are representatives of the control 
elements of a complete gene class. 
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The promoter of the 5' flanking regions of the Lb 
genes functions in soybeans and is responsible for 
the transcription of the Lb genes in root nodules. 
It is furthermore known, that the efficiency of 
5 both the transcription initiation and the subsequent 
translation initation on the leader sequence of the 
Lb genes is high as the Lb proteins constitute ap- 
proximately 20% of the total protein content in 
root nodules . 

lO The sequence of 5' flanking regions of the four 
soybean leghemoglobin genes Lba, Lbc^, Lbc2» and 
Lbc3 appears from the enclosed sequence scheme, 
scheme 1, wherein the sequences are stated in such 
, a manner that the homology between the four 5' 

15 flanking regions appears clearly. 

In the sequence scheme n - n indicates that no base 
is present in the position in question. The names 
of the genes and the base position counted upstream 
from the ATG start codon are indicated to the right 
20 of the sequence scheme. Furthermore the important 
sequences have been underlined. 

As it appears from the sequence scheme a distinct 
degree of homology exists between the four 5' flan- 
king regions, and in the position 23-24 bp upstream 

25 from the CAP addition site they all contain a 
TATATAAA sequence corresponding to the "TATA" box 
which in eukaryotic cells usually are located a 
corresponding number of bp upstream from the CAP 
addition site. Furthermore a CCAAG sequence is 

30 present 64-72 bp upstream from the CAP addition 
sit , said s quence corresponding to the n CCAAT n 
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box usually located 70-90 bp upstream from the CAP 
addition site. From the CAP addition site to the 
translation start codon, ATG , leader sequences of 
52-59 bp are present and show a distinct degree of 
5 homology of approx. 75-80%. 

In accordance with the present invention it has 
furthermore been proved, exemplified by the Lbc3 
gene, that the 5' flanking regions of the soybean 
leghemoglobin genes are functionally active in 

10 other plant species. The latter has been proved by 
fusioning the E. coli chloramphenicol acetyl trans- 
ferase (CAT) gene with the 5' and 3' flanking re- 
gions of the soybean Lbc3 gene in such a manner 
that the expression of the CAT gene is controlled 

15 °y the Lb promoter. This fusion fragment was cloned 
into the integration vectors pARl and pAR22, where- 
by the plasmids pAR29 and pAR30 were produced. 
Through homologous recombination the latter plasmids 
were integrated into the Agrobacterium rhizogenes 

20 T DNA region. The transformation of Lotus cornicu- 
latus (bird's-foot trefoil) plants, i.e. transfer 
of the T DNA region, was obtained by wound infection 
on the hypokotyl. Roots developed from the trans- 
formed plant cells were cultivated in vitro and 

25 freed from A. rhizogenes bacteria by means of anti- 
biotics . Completely regenerated plants were produced 
by these root cultures in a conventional manner 
through somatic embryogenesis or organogenesis. 

Regenerated plants were subsequently inoculated 
30 with Rhizobium loti bacteria and root nodules for 
analysis were harvested. Transcription and trans- 
lation of the chimeric Lbc3 CAT gene could subse- 
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quently be detected in root nodules on transformed 
plants as the activity of the produced chloram- 
phenicol acetyl transferase enzyme. 

The conclusion can subsequently be made that the 
5 promoter-containing 5' flanking regions of root 
nodule-specific genes exemplified by the soybean 
Lbc3 promoter are functionally active in foreign 
plants. The latter is a surprising observation as 
root nodules are only developed as a consequence 
lO of a very specific interaction between the legu- 
minous plant and its corresponding Rhizobium micro- 
symbiont . 

Soybeans produce nodules only upon infection by the 
species Rhizobium j aponicum and Lotus corniculatus 

15 only upon infection by the species Rhizobium lot 1 . 
Soybean and Lotus corniculatus belong therefore to 
two different cross - inoculation groups, each group 
producing root nodules by means of two different 
Rhizobium species. The expression of a chimeric 

20 soybean gene in Lotus corniculatus proves therefore 
an unexpected universal regulatory system applying 
to the expression of root nodule - specif ic genes. 
The regulatory DNA sequences involved can be placed 
on the 5' and 3' flanking regions of the genes, 

25 here exemplified by the 2.0 Kb 5 ' and 0.9 Kb 3' 
flanking regions of the Lbc 3 gene. This surprising 
observation allows the use of root nodule - specif ic 
promoters and regulatory sequences in any other 
plant species and any other plant cell line. 

30 In other experiments the 5' flanking region of the 
nodule- specif ic N23 gene was fused to the CAT gene 
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and the Lbc 3 3' flanking region in such a manner 
that the expression of the CAT gene is controlled 
by the N23 promoter. This fusion fragment was cloned 
into the integration vector pAR22 producing the 
Splasmid N23-CAT which was subsequently recombined 
into A.rhizogenes and transferred to Lotus corn! - 
cujLafrus an d Trifolium renens (white clover) by the 
previously described method. The root nodule-spe- 
cific expression of the transferred N23 - CAT gene 

10 obtained in L. corn iculatus infected with Rhizobium 
lot! and in T.repens infected with Rhizobium tri- 
f oli?L further demonstrated that expression of root 
nodule-specific genes is independent of the plant 
species and Rhi^objum species. A universal regu- 

15 latory system therefore regulates the expression 
of root nodule -specific genes in the different 
symbiotic systems formed between legumes and the 
Rhizobium species of the various cross - inoculation 
groups . 

20lt is known from European Patent Application EP 
122, 791. Al that plant genes from one species, by 
Agrobacterium mediated transformation, can be trans- 
ferred into a different plant species. It is also 
known from EP 122, 791. Al that a transferred gene 

25encoding the seed storage protein "Phaseolin" can 
be expressed into tobacco and alfalfa. From the 
literature it is also known that this expression 
is seed specific (Sengupta-Gopalan et al. 1985, 
Proc. Natl. Acad. Sci. 82, 33203324). 

30The present invention therefore relates to a novel 
method for the expression of transferred genes in 
a roo t nodul e- specif ic manner . using DNA regulatory. 
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sequences from the 5' promoter region, the coding 
region, or the 3' flanking region of root nodule- 
specific genes, here exemplified by the leghemoglo- 
bin Lbc 3 gene and the N23 gene. This method is 
5 distinct from . both the method of Agrobacterium 
mediated transformation and expression of the seed 
storage, protein phaseolln gene characterised in EP 
122,791. Al. Expression of the transferred phaseolin 
gene in EP 122, 791. Al only demonstrates that the 

lO phaseolin gene family with its particular regulatory 
requirements can be expressed in tobacco and alfal- 
fa. It does not demonstrate nor predict that an,y 
other genes with their particular regulatory re- 
quirements can be expressed in any other plants or 

25 plant tissue . 

An ob j ect of the present invention is to provide a 
possibility of expressing desired genes in plants, 
parts of plants, and plant cell cultures. 

A further object of the invention is to render it 
20 possible to express genes of any origin by the 
control of an inducible root nodule - specif ic pro- 
moter . 

A particular object of the invention is to provide 
a possibility of expressing desired genes in legu- 
25 minous plants. 

A still further particular object of the invention 
is to provide a possibility of expressing root 
nodule - specif ic genes in non- leguminous plants. 

Further objects of the invention are to improve the 
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existing nitrogen- fixing systems in leguminous 
plants as well, as to incorporate nitrogen- fixing 
systems in other plants. 

A further object of the invention is to provide a 
5 possibility of in certain cases allowing the use 
of specific sequences of the 3' flanking region, 
of the coding sequence, and of intervening sequences 
to influence the regulation of the root nodule- 
specific promoter. 

10 Furthermore it is an object of the invention to 
provide plasmids comprising the above mentioned 
inducible plant promoter. 

Further objects of the invention appear immediately 
from the following description. 

15 The method according to the invention far the ex- 
pression of genes in plants, parts of plants, and 
plant cell cultures is carried out by introducing 
into a cell thereof a recombinant DNA segment con- 
taining both the gene to be expressed and a 5' 

20 flanking region comprising a promoter sequence, and 
optionally a 3' flanking region, and culturing of 
the transformed cells in a growth medium, said 
method being characterised by using as the recom- 
binant DNA segment a DNA fragment comprising an 

25 inducible plant promoter (as defined) from root 
nodule - specif ic genes. If desired the transformed 
cells are regenerated to plants. 

The method according ta the invention allows in a 
well defined manner an expression of foreign genes 
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in plants, parts of plants, and plant cell cultures, 
in this connection especially genes providing the 
plants with desired properties such as for instance 
a resistance to plant diseases and increased content 
5 of valuable polypeptides. 

A further use is the preparation of valuable pro- 
ducts such as for instance dyes, flavourings, plant 
hormones, pharmaceutical products, primary and 
secondary metabolites, and polypeptides by means 
10 of the method according to the invention in plant 
cell cultures and plants. 

By using the method according to the invention for 
the expression of root no dule - specif ic genes it is 
possible to express root nodule - specif ic genes 

15 necessary for the formation of an active nitrogen- 
fixing system both in leguminous plants and other 
plants. The correct developmental control, cf. 
Example 8, allows the establishment of a symbiotic 
nitrogen-fixing system in non- leguminous plants. In 

20 this manner it is surprisingly possible to improve 
the existing nitrogen- fixing systems in leguminous 
plants as well as to incorporate nitrogen- fixing 
systems in other plants. 

The use of the method according to the invention 
25 for the expression of foreign genes in root nodules 
renders it possible to provide leguminous plants 
with improved properties such as resistance to 
herbicides and resistance to diseases and pest. 

According to a particular embodiment of the method 
3Qacc rding to the invention a DNA fragment is used 
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which comprises an inducible plant promoter and 
which is identical with, derived from, or comprises 
5' flanking regions of leghemoglobins genes. In this 
manner the expression of- any gene is obtained. 

5 Examples of such DNA fragments are DNA fragments 
of the four 5' flanking regions of the soybean 
leghemoglobin genes, viz. 



Lba with the sequence: 



10 GAGATACATT ATAATAATCT CTCTAGTGTC TATTTATTAT TTTATCTGGT 
GATATATACC TTCTCGTATA CTGTTATTTT TTCAATCTTG TAGATTTACT 
TCTTTTATTT TTATAAAAAA GACTTTATTT TTTTAAAAAA AATAAAGTGA 
ATTTTGAAAA CATGCTCTTT GACAATTTTC TGTTTCCTTT TTCATCATTG 
GGTTAAATCT CATAGTGCCT CTATTCAATA ATTTGGGCTC AATTTAATTA 
GTAGAGTCTA CATAAAATTT ACCTTAATAG TAGAGAATAG AGAGTCTTGG 
AAAGTTGGTT TTTCTCGAGG AAGAAAGGAA ATGTTAAAAA CTGTGATATT 

15 TTTTTTTTGG ATTAATAGTT ATGTTTATAT GAAAACTGAA AATAAATAAA 
CTAACCATAT TAAATTTAGA ACAACACTTC AATTATTTTT TTAATTTGAT 
TAATTAAAAA ATTATTTGAT TAAATTTTTT AAAAGATCGT TGTTTCTTCT 
TCATCATGCT GATTGACACC CTCCACAAGC CAAGAGAAAC AC AT A AG CTT 
TGGTTTTCTC ACTCTCCAAG CCCTCTATAT AAACAAATAT TGGAGTGAAG 
TTGTTGCATA ACTTGCATCG AACAATTAAT AGAAATAACA GAAAATTAAA 
AAAGAAATAT G, 



2Q Lbc^ with the sequence: 



TTCTCTTAAT ACAATGGAGT TTTTGTTGAA CATACATACA TTTAAAAAAA 

AATCTCTAGT GTCTATTTAC CCGGTGAGAA GCCTTCTCGT GTTTTACACA 

CTTTAATATT ATTATATCCT CAACCCCACA AAAAAGAATA CTGTTATATC 

TTTCCAAACC TGTAGATTTA TTTATTTATT TATTTATTTT TACAAAGGAG 

ACTTCAGAAA AGTAATTACA TAAAGATAGT GAACATCATT TTATTTATTA 

TAATAAACTT TAAA AT C AAA CTTTTTTATA TTTTTTGTTA CCCTTTTCAT 

25 TATTGGGTGA AATCTCATAG TGAAGCCATT AAATAATTTG GGCTCAAGTT 

TT ATT AG T A A AGTCTG CATG AAATTTAACT TAACAATAGA GAGAGTTTTC 

GAAAGGGAGC GAATGTTAAA AAGTGTGATA TTATATTTTA TTTCGATTAA 

TAATTATGTT TACATGAAAA CATACAAAAA AATACTTTTA AATTCAGAAT 

AATACTTAAA ATATTTATTT GCTTAATTGA TTAACTGAAA ATTATTTGAT 

TAGGATTTTG AAAAGATCAT TGGCTCTTCG TCATGCCGAT TGACACCCTC 

CACAAGCCAA GAGAAACTTA AGTTGTAAAC TTTCTCACTC CAAGCCTTCT 

AT AT A A A CAT GTATTGGATG TGAAGTTATT GCATAACTTG CATTGAACAA 

30 TAGAAAATAA CAAAAAAAAG TAAAAAAGTA GAAAAGAAAT ATG , 
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Lbc2 with the sequence 



TCGAGTTTTT 
TTTATTCGGC 
ATCCCCACCC 
TTATTTCTTA 
5 ATAGTGAACA 
TTATATTTTT 
ACTATTAAAT 
TTAACTTAAT 
GTGATATTAT 
TTGACAATTT 
TTTAAGATTT 
CTCCACAAGC 
1Q TC TATATAAA 
CAATAGAAAT 



ACTGAACATA 
GAGAAGCCTT 
CC AC CAAAAA 
TTTTTACAAA 
TCATTTTTTT 
TTGTTACCCT 
AGTTTGGGCT 
AATAGAGAGA 
TATAGTTTTA 
ATTTTTAAAA 
TG AAA AG ATC 
CAAG AGAAAC 
CACGTATTGG 
AACAACAAAG 



CATTTATTAA 
CTCGTGCTTT 
AAAAAAAACT 
GGAAACTTCA 
AGTTAAGATG 
TTTCATTATT 
CAAGTTTTAT 
GTTTTGGAAA 
TTTAGATTAA 
TTCAGAGTAA 
ATTTGGCTCT 
TTAAGTTGTA 
ATGTGAAGTT 
AAAATAAGTG 



AAAAAACTCT 
ACACACTTTA 
GTTATATCTT 
CGAAAGTAAT 
AATTTTAAAA 
GGGTGAAATC 
TAGTAAAGTC 
GGTAACGAAT 
TAATTATGTT 
TACTTAAATT 
TCATCATGCC 
ATTTTTCTAA 
GTTGCATAAC 
AAAAAAGAAA 



CTAGTGTCCA 
AT AT T ATT AT 
TCCAGTACAT 
TACAAAAAAG 
TCACACTTTT 
TCATAGTGAA 
TGCATGAAAT 
GTTAGAAAGT 
TACATGAAAA 
ACTTATTTAC 
GATTGACACC 
CTCCAAGCCT 
TTGCATTGAA 
TATG , 



and Lbcj with the sequence: 



TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA 
GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT 

15 GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA 
AT AAA A AT AG TGAACATCGT CTAAGCATTT TTATATAAGA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT 
TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TG GAT A A A AT 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT 

2o AATTTTAACT TAAAAATAGA GAAAATCTGG AAA AG GG ACT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT 
CTAAAAAAAT ATATATT A A A ATTTTAAATT CAGAATAATA 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA 
TTCACCATAC CAATTGATCA CCCTCCTCCA AC A AG C C A AG 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA 
TTGGATGTGA AGTTGTTGCA TAACTTG CAT TGAACAATTA 

9 q CAGAAAAGTA GAAAAGAAAT ATG • 

A further embodiment of the method according to 
the invention uses a DNA fragment identical with, 
derived from or comprising 5 r flanking regions of 
the Lbc3 - 5 ' - 3 ' - CAT gene with the sequence: 

30 TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 

GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 

GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 

ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TCAATTTTAA 

AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 

TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCCAGTTTGA 



ACCAACAAAA 
TTCCAATCTT 
AAATTACAAA 
TGAATTTTAA 
AGAGCCATTT' 
CCGAGTTTGA 
CTC GTAGTG A 
AG TTTG CAT A 
GTTAAAAAGT 
TAT ATG G A AA 
CTT A A ATT AT 
TGATTGTCTC 
AG AG AC AT AA 
TAAATAA GTA 
ATAGAAATAA 
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TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGT^GTCA 

CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTG CAT A 

AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAG GG ACT GTTAAAAAGT 

GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TAT.- ^GG AAA 

CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAAT^YT 

5 TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGA^GTC^C 

TTCACCATAC CAATTGATCA CCCTCCTCCA ACA AG CCA AG AGAGACATAA 

GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC^^ ?ra ^A^AGT" 

TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAA7TA ATAGAAAT AA 

CAGAAAAGTA G AATTCT AAA ATG X ^ 

XO A still further preferred embodiment of the method 
according to the invention uses a DNA fragment 
identical with, derived from or comprising 5' flank- 
ing regions of the N23 gene with the sequence 



lO 20 30 40 SO 60 70 

GAATTCGAGCTCGCCCGGG<^CGATCCTCT 
EcoRX SaLL 

BO 90 lOO 110 120 130 140 

25 ttctattgagacacgatttgaacarittttacattatgagacr 
aaatttaaagctttagatStgatgaatogaa^ 

220 230 240 250 2€Q 270 2BO 

AT GAATGCT ATGAT ATT GAT GGTCTT GATN T ATTNNCAGAATT GAAAGT ATT AAGAGAAGT GTT AAGAAA 

AGAAGTTAGCAC&CCAATAGAAGTATTGAGTTATATTAAAACT 

360 370 380 390 400 410 420 

CATATAGAATTTTATTGACAATC 

430 440 450 460 470 480 490 

20 ACTTAAATGATATCTAAAATCAACAATGTTACAAGATAGATTGAATGA 

500 SIO S20 530 540 5SO 560 

AGTAAACTGlTAGAATTGTrc^^ 

570 580 590 60O. 610 . 620 630 

TAATATAAAAATTGATATTTTATATAATATATTAAGTCT CTTTAAAATTCTT GTAAAAAAAGACATTTTT 

640 650 660 670 6SO 690 700 

AAATAATAAAAXAAAGCAACTCTTAATTTTAAIGAAAC^ 

7lO 720 730 740 750 760 770 

AAJU^TTAATGGTTGATGGAAGTTTTTAATTTGTTCTACT 

780 790 BOO 810 820 830 840 

25 TATCATTTATATGTTGTAAATATGAATGCACTAGTAATTAGTTTAATGAT 
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850 860 870 880 890 900 910 

ATTTCTGTCTCTTGGCAACTCGTGAGAAra 

920 930 940 950 960 970 980 

AGAATAAATATTTATATACAATTCCTAGATTTTGTTATAAAATTC^ 

990 100O 1010 1020 1030 1040 10 SO 

GAGCACACACCAAACTAGTCTCAAATTAAGTAAGGTGCTAATTATTAGCGGCTAG CTAAG TAACCAAGTA 



ATTAAT6 

5 In a particularly preferred embodiment of the method 
according to the invention a 3' flanking region of 
root nodule -specific genes is furthermore used , in 
particular sequences of the 3' flanking region ca- 
pable of influencing the activity or regulation of 
lO a promoter of the root nodule - spec if ic genes or 
the transcription termination, or capable of in- 
fluencing the yield of the desired gene product in 
another manner . 

Examples of such 3' flanking regions are the four 
15 3' flanking regions of the soybean leghemoglob in 
genes, viz. 

Lba with the sequence: 



1590 1620 
TAA TTA GTA TCT* ATT GCA GTA AAG TGT AAT AAA TAA ATC TTG 

16 SO 1680 
TTT CAC TAT AAA ACT TGT TAC TAT TAG ACA AGG GCC TGA TAC AAA ATG TTG GTT AAA ATA 

1710 1740 
20 ATG GAA TTA TAT ACT ATT GGA TAA AAA TCT TAA GGT TAA TAT TCT ATA TTT GCG TAG GTT 

1770 1800 
TAT GCT TGT GAA TCA TTA TCG GTA TTT TTT TTC CTT TCT GAT AAT TAA TCG GTA AAT TA 

1830 1860 
ACA AAT AAG TTC AAA ATG ATT TAT ATG TTT CAA AAT TAT TTT AAC AGC AGG TAA AAT GTT 



ATT TGG TAC GAA AGC TAA TTC GTC GA 
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Lbc^ with the sequence: 



1320 

TAA/TT AG6 ATC TAC TGC ATT GCC GTA 



1350 1380 
AAG TGT AAT AAA TAA ATC TTG TTT CAA CTA AAA CTT GTT ATT AAA CAA GTT CCC TAT ATA 

1410 1440 
AAT GTT GTT TAA AAT AAG TAA ATT TGA TTG TAT TGG ATA AAC ACT TTT AAG TTA TAT ATT 

1470 1500 
5 TCC ATA TAT TTA CGT TTG TGA ATC ATA ATC GAT ACT TTA TAA AAA TAA ATT CCA AAT AAT 



TTA TAC GTT TTA AAA ATT ATT TT 



Lbc2 with the sequence: 



TAG/GAT CTA CTA TTG CCG TCA AGT 

X140 

GTA ATA AAT AAA TTT TGT TTC ACT AAA ACT TGT TAT TAA ACA AGT CCC CGA TAT ATA AAT 

1170 12 OO 

lO GTT GOT TAA AAT AAG TAA ATT ATA CGG TAT TGA TAA ACA ATC TTA AGT TTT ATA TAT AGT 

1230 12 60 

TCC ATA TAC TAA AGT TTG TGA ATC ATA ATC GA 

1290 



and Lbc3 with the sequence; 



TAG/GAT CTA CAA TTG CCT TAA AGT GTA ATA AAT AAA 
990 1020 

TAT TAT TTC ACT AAA ACT TGT TAT TAA ACC AAG TTC TCG ATA TAA ATG TTG GTT AAA CTA 

lOSO 1080 

TC AGT AAA TTA TAT GGT ATT GGA TAA ACA ATC TTA AGC TT 

1110 



This sequence is positioned on the 0.9 Kb 3' flan- 
king region used according to the invention. A 
particular embodiment of the invention is therefore 
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the use of sequences of this region exerting or 
mediating the regulation characterised by the in- 
vention of root nodule-specific promoter regions. 

In a preferred embodiment of the method according 
5 to the invention a region is used of the coding 
sequence or intervening sequence of root nodule- 
specific genes, in particular sequences of the 
coding sequence or the intervening sequence capable 
of influencing the regulation of a promotor of the 
lO root nodule - specif ic genes or capable of influenc- 
ing the yield of the desired gene product in another 
manner ♦ 



Examples of such coding sequences and intervening 
sequences are the four leghemoglobin genes of soy- 
15 bean , viz . 



Lba with the sequence: 120 

VAL 
ATG/GTT 



ISO 



ieo 



ALA 


PHE 


THR GLU 


GCT 


TTC 


ACT 


GAG 


ILE 


PRO 


GLN 


TYR 


ATT 


CCT 


CAA 


TAC 


CCA 


TTC 


TAT 


GTT 


GAG 


TGG 


TTT 


TGG 


LEU 


PHE 


SER PHE 


TTG 


TTC 


TCA 


TTT 


GLU 


LYS 


LEU 


PHE 


GAA 


AAG 


CTT 


TTT 


TTA 


ATT 


TTA 


AGA 


TGT 


TTG 


AAT 


TGT 


TAT 


TAG 


TAT 


TTG 



210 *«° 

TYR THR SER 

TAC ACT TC/G TAA GTT TTC TCT CTA AGC ATG TGT CTT 

270 300 
ATT TGT TGT GTT TGA AAA AAG ATA TAT TGT TAA TGT 

33O 360 

ILE LEU GLU LYS ALA PRO ALA ALA LYS ASP 
TGA ATAG/G ATA CTG GAG AAA GCA CCT GCA GCA AAG GAC 

390 420 
VAL ASP PRO THR ASN PRO LYS LEG THR GLY HIS ALA 
STA GAC CCC ACT AAT. CCT AAC CTC ACG GGC CAT GCT 

450 480 

TCA CCC AAC TAA AAT TAT AAC TAT TTT ATG TGA 

510 540 
TAT TTT AAC. ACT CTT AAA ACA TCA ATG AAC ATT AAT 

570 6 OO 

25 

630 660 
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690 720 
VAL ARG ASP SER ALA GLY GLN LEU LYS ALA SER GLY THR VAL VAL ALA 
TTT TGA ATT GTAG/GTG CGT GAC TCA GCT GGT CAA CTT AAA GCA ACT GGA ACA GTG GTG GCT 

750 78o 
ASP ALA ALA LED GLY SER VAL HIS ALA GLN LYS ALA VAL THR ASP PRO GLN PHE VAL 
GAT GCC GCA CTT GGT TCT GTT CAT GCC CAA AAA GCA GTC ACT GAT CCT CAG TTC GTG/GT 

810 840 
ATG ATA AAT AAT GAA ATG TTA TAA TAA ATT ATG CAT ACT TCA ATT TTT CAT GGA GCA GTA 

870 900 
TAA TGA TCA ACA CAC ACT TCT TTT GTT TCA TGC ATT TGA TAA CTA CAA TCT TAA AAT GTT 

930 960 
5 GCA ATC TTA AAA ATA GTA TTA AAA ATA TAA CAT TTA ATT AGC TCA TCA ATA TTT TTC TGT 

990 1020 
TGC AAT TTT TTA TGA AAA AAT TAT AAT TAT GAA TTC TTT GAG CAA TGT TTA ATT AAA AAA 

10S0 1080 
TTG ATT TAA TAA TGA AAT AAC TAA GCT ACC TCT GTC TCG TTT TTC ATT TAA ACT ATG ACA 

1110 1140 
TAA ACA ATG AAT AAA GTA AAC TAA ACC ATG ACA TGT TTA TTT TTG AAT GAG GTT ATT AAT 

1170 120O 
AAT TTT TTT TCA CTA TCT ATT GCA ATG TTC ATT GAT TAT CAA TTA TCT TGG TTG CAT TGA 

1230 1-60 
10 TTC TCT CGA TTT TTT TCT TGA GGT TAA GCT TCA GTT CAA TAT ATA TTC. ATT TTT TGA TAA 

1290 1320 
AAA AAA ATA GTA CAA TAT ATT TTC ATT TAG CTG ATC ATA TTT ATT TAA GTT CAA CTT AAA 

1350 1380 
ATT TTA TAG ATG TTA ATT GAT ATA ATT TGT TGA GAT GAT GAG AAG ACC AAT ACC ATT ACG 

1410 14 40 

TAC TCT TTT GAA AGT GTT ATA TGG ATT TTA ATT ATA AGG.AAA AAT GTA AGA GCT AAA CCA 

1470 1500 
VAL VAL LYS GLU ALA LEU LEO LYS THR ILE LYS ALA ALA VAL 
15 TTG CTG ATG ATT TTG AAG/GTG GTT AAA GAA GCA CTG CTG AAA ACA ATA AAG GCA GCA GTT 

1530 1560 
GLY ASP LYS TRP SER ASP GLU LEU SER ARG ALA TRP GLU VAL ALA TYR ASP GLU LEU ALA 
GGG GAC AAA TGG AGT GAC GAG TTG AGC CGT GCT TGG GAA GTA GCC TAC GAT GAA TTG GCA 



ALA ALA ILE LYS LYS ALA 
GCA GCT ATT AAG AAG GCA TAA 



The amino acid sequence of the Lba protein is in- 
2Q dicated above the coding sequence, 



Lbc^ with the 



sequence : 
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ISO 
GLY 
ATG/GCT 

210 240 
ALA PHE THR GLU LYS GLN GLU ALA LEU VAL SER SER SER PHE GLU ALA PHE LYS ALA ASN 
GCT TTC ACT GAG AAG CAA GAG GCT TTG GTG AGT AGC TCA TTC GAA GCA TTC AAG GCA AAC 

270 300 
ILE PRO GLN TYR SER VAL VAL PHE TYR ASN SER 

ATT CCT CAA TAC AGC GTT GTG TTC TAG AAT TC/GTAA GTT TTC TCT ATA AGC ATG TGT CTT 

330 360 
TCA TTC -TAT GTT TTT CTT -CTG GAA ATT TTT TGT GTT TGA AAA AAG ATA TAT ATA TAT ATA 

390 420 
5 TAT ATA TAT ATA TAT ATA TAT ATA TAT ATA TAT ATA TAT TTT GTT AAT GTG AGT GGT TTT 

450 480 

ILE LEU GLU LYS ALA PRO ALA ALA LYS ASP LEU PHE SER 
GGT TTG ATT AAA AAT AAA TAG/GATT CTG GAG AAA GCA CCT GCA GCA AAG GAC TTG TTC TCA 

SIO 540 
PHE LEU ALA ASN GLY VAL ASP PRO THR ASN PRO LYS LEU THR GLY HIS ALA GLU LYS LEU 
TTT CTA GCA AAT GGA GTA GAC CCC ACT AAT CCT AAG CTC ACG GGC CAT GCT GAA AAG CTT 

S70 6CO 

PHE ALA LEU 

TTT GCA TTG/GT AAG TAT CAG CCA ACT AAA ATT ATA ACT ATT TTA TGT GAT TAA TTT TAA 

630 660 
GAT TAA ACA TCA TGT ATT TTA ACA CTC TTA AAA TAT CAA TGA ACA TTA ATT TTT TGA ATT 

690 720 
lO GTA TTT TAT ATT TTT ACC ATA TCT TGA ACT AGG AAT AAT ATA TAA ATT TCT ATT AGT ATT 

750 780 
TGT TGG TAA TTA CAT ATA TAT ATA TAT ATA TAA TCC TTG TGA TAA TTA TTT TTC GAA TTT 

810 840 
VAL ARG ASP SER ALA GLY GLN LEU LYS THR ASN GLY THR VAL VAL ALA ASP ALA ALA 
GTAG/GTG CGT GAC TCA GCT GGT CAA CTT AAA ACA AAT GGA ACA GTG GTG GCT GAT GCT GCA 

870 900 

LEU VAL SER ILE HIS ALA GLN LYS ALA VAL THR ASP PRO GLN PHE VAL 

CTT GTT TCT ATC CAT GCC CAA AAA GCA GTC ACT GAT CCT CAG TTC GTG/GT ATG ATA AAT 

930 96b 
AAT ACT AGT AAA ATG TTA CAA TAA ATG CAA ACT TAA GTT TTA CGT ACA TAG TGA TCA TGA 

990 1020 
15 CTT CAT GCA TGG CTA TTA TTT TTT CAT ATT TAT TGA AGT CAA CTT AAA ATT TTG TAA ATA 

1050 1O80 
CAG ATC GAT GCT AGT AAT TTG TTG AGA TCA TGA GAA AAC GTA CCA CTA CTC CAA TAG CAT 

1110 1140 
TAC TCA TTT TGA AAA TTG TAT AAC TGT GAT CTA ATT ATA AGG AAA AAG TGT ATA TAA GAG 

1170 1200 
VAL VAL LYS* GLU ALA LEU LEU LYS THR 
CTA ATC CAT TAT TAA TGT TTT TTA TAT TTT GTAG/GTG GTT AAA GAA GCA CTG CTG AAA ACA 

1230 1260 
ILE LYS GLU ALA VAL GLY GLY ASN TRP SER ASP GLU LEU SER SER ALA TRP GLU VAL ALA 
ATA AAG GAA GCT GTT GGC GGC AAT TGG AGT GAC GAA TTG AGC AGT GCT TGG GAA GTA GCC 

1290 

TYR ASP GLU LEU ALA ALA ALA ILE LYS LYS ALA 
20 TAT GAT GAA TTG GCA GCA GCA ATT AAA AAG GCA TAA 



The amino acid sequence of the Lbc^ protein is 
indicated above the coding sequence, 
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Lbc2 with, the sequence: 



GLY 
G/GGT 
180 

ALA PHE THR GLU LYS GLN GXU ALA LEU VAL SER SER SER PHE GLU ALA PHE LYS ALA ASH 
GCT TTC ACT GAG AAG CAA GAG GCT TTG GTG AGT AGC TCA TTC GAA GCA TTC AAG GCA AAC 

210 240 

XLE PRO GLN TYR SER VAL VAL PRE TYR THR SER 

ATT CCT CAA TAC AGC GTT GTG TTC TAC ACT TC/GTA AGT TTT CTC TTA AAG CAT GTA TCT 

270 300 

5 TTC ATT CTC TGT TTT TCC TTT CGA CAT TTT TTG TGT TTG AAA AGA GAT AGT GTC AAT GTG 

330 360 

XLE LEU GLU LYS ALA PRO ALA ALA LYS 
AGT GGG TAT TTT TTT TTA TTA AAA ATT AAC AG/G ATA CTG GAG AAA GCA CCC GCA GCA AAG 

390 420 

ASP LEU PHE SER PRE LEU SER ASK GLY VAL ASP PRO SER ASH PRO LYS LEU THR GLY HIS 
GAC TTG TTC TCG TTT CTA TCT AAT GGA GTA GAT CCT AGT AAT CCT AAG CTC ACG GGC CAT 

4S0 480 

ALA GLU LYS LEU PHE GLY LEU 

GCT GAA AAG CTT TTT GGA TTG/GTA AGT ATC ATC CAA CTA AAA TTA TAG CTA TTT TAT GTG 

5lO 540 

lO ATT AAT TTT AAG ATT AAA CAT GTA TTT AAC ACT CTT AAA CAT GTA TTT AAC ACT CTT AAG 

570 6CO 

ATT AAA CAT GTA TTT AAC TAA AAC ATG TAT TTG CTG ATT ATT TTT TTT TTA TAA TTA TCT 

630 660 

VAL ARG ASP SER ALA GLY GLN LEU LYS ALA 
TGT CAC ATA TTA TAT ATT TTT TGA ATT GTA G/GTG CGT GAC TCA GCT GGT CAA CTT AAA GCA 

690 . 720 

a! 2 22 12? SS* m W SER HIS ALA GLN LYS ALA ILE THR 

AAT GGA ACA GTA GTG GCT GAT GCC GCA CTT GGT TCT ATC CAT GCC CAA AAA GCA ATC ACT 

750 780 

15 259 GLN PHE VAL 

CCT GT^/GT ATG ATA AAT AAT AAA ATG TTA CAA TAA ATG CAC ATA TAC TTA 

810 840 
AAT TTT ACA TGG TGC AGT GTT ATG ATC ATC ATT TTT GTT TAG TAA TGA ATT TAC TTA AAA 

870 900 

TCT TAA ATT ATG TAC TTT TTG AAA GTT TTA TAT GGA ATT TTA ATT ATA GGG AAA AAT GTA 

930 960 

AGA GCT AAT CCA TTA GTG ATG TTT TGT CTG T A G/GTG GTT AAA GAA GCA CTG CTG AAA ACA 



990 



1020 



LE LYS GLU ALA VAL GLY ASP LYS TRP SER ASP GLU LEU SER SER ALA TRP GLU VAL ALA 
ATA AAG GAG GCA GTT GGG GAC AAA TGG AGT GAT GAA TTG AGC AGT GCT TGG GAA GTA GCC 



10SO 

20*** ASP GLU LEU ALA ALA ALA XLE LYS LYS ALA PHE 

TAT GAT GAA TTG GCA GCA GCT ATT AAG AAG GCA TTT TAC 

lllO 



1080 
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The amino acid sequence of the Lbc£ protein is 
indicated above the coding sequence, 

and Lbc3 with the sequence: 

GLY ALA PHE THR ASP 
C/GGT GCT TTC ACT GAT 
' 120 

LYS GLN GLU ALA LEU VAL SER SER SER PHE GLU ALA PHE LYS THR ASN ILE PRO GLN TYR 
£ AAG CAA GAG GCT TTG GTG ACT AGC TCA TTT GAA GCA TTC AAG ACA AAC ATT CCT CAA TAC 
° 150 180 

SER VAL VAL PHE TYR THR SER 

AGT GTT GTG TTC TAC ACC TC/GTA AGT ATT CTA TCT AAA TTA TGT GTC TTA TTG TAT GTT 

210 240 

TAA CTT TCG TGG TTT GTT GTG TTT GAA AAA AAG ATA TAT ATT GTT AAT GTG AGT GGT TTT 

270 30O 

ILE LEO GLU LYS ALA PRO VAL ALA LYS ASP LEU PHE SER 
GGT TTG ACT AAA AAT GAA TAG/G ATA CTG GAG AAA GCA CCT GTA GCA AAG GAC TTG TTC TCA 

330 360 

TO PHE LEU ALA ASN GLY VAL ASP PRO THR ASN PRO LYS LEU THR GLY HIS ALA GLU LYS LEU 
TTT CTA GCT AAT GGA GTA GAC CCC ACT AAT CCT AAG CTC ACG GGC CAT GCT GAA AAA CTT 

390 420 

PHE GLY LEU 

TTT GGA TTG/GT AAG TAT CCA GCC TAC TAA AAT TAA AAT CCT ATT AGT ATT TTT TAT TAT 

450 480 



VAL ARG ASP SER 

TTT TCT TCC ATG ATT GTC TTG TCA CAT ATT ATA TAT TTT TTG AAT TAT AG/ GTA CGT GAT TCA 

SlO 540 

ALA GLY GLN LEU LYS ALA SER GLY THR VAL VAL ILE ASP ALA ALA LEU GLY SER ILE HIS • 
GCT GGT CAA CTT AAA GCA AGT GGA ACA GTG GTG ATT GAT GCC GCA CTT GGT TCT ATC CAT 

570 600 

15 ALA GLN LYS ALA ILE THR ASP PRO GLN PHE VAL 

GCC CAA AAA GCA ATC ACT GAT CCT CAA TTT GTG/G TAT GAT AAA TAA TGA AAA GCT ACA 

630 660 



ATA AAT GCA CAA ATA CTT AAT TTT ACA TAG TGC AGT GCT ATA TGA TCA TCA CTT TTG CTT 

690 720 

AGT AAT GAA TTT ACT TTT TTT TTT TAC AGA AGT AAT GGA TTT ACT TAA AAT CTT AAA TTA 

7SO 760 

TGT ACT TCT TTA AAG AGT TTT GTA TGG AAT TTT AAT TAT AGG AAA AAT GTA AGA GCT AAA 

BIO 840 

VAL VAL LYS GLU ALA LEU LEU LYS THR ILE LYS GLU ALA 
CCA TTG CTG ATG ATT TCG AAG/GTG GTT AAA GAA GCA CTG CTG "AAA ACA ATA AAG GAG GCA 

870 900 

20 VAL GLY ASP LYS TRP SER ASP GLU LEU SER SER ALA TRP GLU VAL ALA TYR ASP GLU LEU 
GTT GGG GAC AAA TGG AGT GAC GAG TTG AGC AGT GCT TGG GAA GTA GCC TAT GAT GAA TTG 

930 „ 960 



ALA ALA ALA ILE LYS LYS ALA PHE 
GCA GCA GCT- ATT AAG AAG GCA TTT TAG 
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The amino acid sequence of the LDC3 protein is 
indicated above the coding sequence. 

The present invention furthermore deals with a 
novel DNA fragment comprising an inducible plant 
5 promoter to be used when carrying out the method 
according to the invention, said DNA fragment being 
characterised by being identical with, derived 
from or comprising a 5' flanking region of root 
nodule -specific genes. Examples of such DNA 

10 fragments are DNA fragments being identical with, 
derived from or comprising a 5' flanking region of 
plant leghemoglobin genes. Preferred examples are 
according to the invention DNA fragments being 
identical with r derived from or comprising a 5' 

15 flanking region of the four soybean leghemoglobin 
genes , viz . : 

Lb a with the sequence: 



GAGATACATT 
GATATATACC 

>q TCTTTTATTT 
ATTTTGAAAA 
GGTTAAATCT 
GTAGAGTCTA 
AAAGTTGGTT 
TTTTTTTTGG 

25 CTAACCATAT 
TAATTAAAAA 
TCATCATGCT 
TGGTTTTCTC 
TTGTTG CATA 
AAAGAAATAT 



ATAATAATCT 

TTCTCGTATA 

TTATAAAAAA 
CATGCTCTTT 

CAT AGTG CCT 
CATAAAATTT 
TTTCTCGAGG 
ATTAATAGTT 
TAAATTTAGA 
ATTATTTGAT 
GATTGACACC 
ACTCTCCAAG 
ACTTGCATCG 
G, 



CTCTAGTGTC 
CTGTTATTTT 
GACTTTATTT 
GACAATTTTC 
CTATTCAATA 
ACCTTAATAG 
AAGAAAGGAA 
ATGTTTATAT 
ACAACACTTC 
TAAATTTTTT 
CTCCACAAGC 
CCCTC TATAT 
AACAATTAAT 



TATTTATTAT 
TTCAATCTTG 
TTTTAAAAAA 
TGTTTCCTTT 
ATTTGGGCTC 
TAGAGAATAG 
ATGTTAAAAA 
GAAAACTGAA 
AATTATTTTT 
AAAAGATCGT 
CAAG AGAAAC 
AAACAAAT AT 
AGAAATAACA 



TTTATCTGGT 
TAGATTTACT 
AATAAAGTGA 
TTCATCATTG 
AATTTAATTA 
AGAGTCTTGG 
CTGTGATATT 
AATAAATAAA 
TTAATTTGAT 
TGTTTCTTCT 
AC AT AAG CTT 
TG G AGTG A AG 
GAAAATTAAA 
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Lbc^ with the sequence 



TTC7CTTAAT ACAATGGAGT TTTTGTTGAA CATACATACA TTTAAAAAAA 
AATCTCTAGT GTCTATTTAC CCGGTGAGAA GCCTTCTCGT GTTTTACACA 
CTTTAATATT ATTATATCCT CAACCCCACA AAAAAGAATA CTGTTATATC 
TTTCCAAACC TGTAGATTTA TTTATTTATT TATTTATTTT TACAAAGGAG 
ACTTCAGAAA AGTAATTACA TA A AG AT AG T GAACATCATT TTATTTATTA 
TAATAAACTT TAAAATCAAA CTTTTTTATA TTTTTTGTTA CCCTTTTCAT 
TATTGGGTGA AATCTCATAG TG A AG CC ATT AAATAATTTG GGCTCAAGTT 
TTATTAGTAA AGTCTGCATG AAATTTAACT TAACAATAGA GAGAGTTTTC 
GAAAGGGAGC GAATGTTAAA AAGTGTGATA TTATATTTTA TTTCGATTAA 
TAATTATGTT TACATGAAAA CATACAAAAA AATACTTTTA AATTCAGAAT 
AATACTTAAA ATATTTATTT GCTTAATTGA TTAACTGAAA ATTATTTGAT 
TAGGATTTTG AAAAGATCAT TGGCTCTTCG TCATG CCG AT TGACACCCTC 
CACAAGCCAA GAG A A ACT T A AGTTGTAAAC TTTCTCACTC CAAGCCTTCT 
ATATAAA CAT GTATTGGATG TGAAGTTATT GCATAACTTG CATTGAACAA 
TAG AAA AT AA CAAAAAAAAG TAAAAAAGTA GAAAAGAAAT ATG, 



Lbc2 with the sequence: 



TCGAGTTTTT 
TTTATTCGGC 
ATCCCCACCC 
TT AT^TC^T A 
ATAGTGAACA 
TTATATTTTT 
ACTATTAAAT 
TTAACTTAAT 
G T GAT ATT AT 
TTGACAATTT 
TTTAAGATTT 
"CTCCACAAGC 
TC TATATAAA 
CAATAGAAAT 



ACTGAACATA 
GAGAAGCCTT 
CC AC C A AA AA 
TTTTTACAAA 
TCATTTTTTT 
TTGTTACCCT 
AGTTTGGGCT 
AATAGAGAGA 
TATAGTTTTA 
ATTTTTAAAA 
TGAAAAGATC 
CAAG AGAAAC 
CACGTATTGG 
AACAACAAAG 



CATTTATTAA 
CTCGTGCTTT 
AAAAAAAACT 
GGAAACTTCA 
AGTTAAGATG 
TTTCATTATT 
CAAGTTTTAT 
GTTTTGGAAA 
TTTAGATTAA 
TTC AG AG T A A 
ATTTGGCTCT 
TTAAGTTGTA 
ATGTGAAGTT 
AAAATAAGTG 



AAAAAACTCT 
ACACACTTTA 
GTTATATCTT 
CGAAAGTAAT 
AATTTTAAAA 
GGGTGAAATC 
TAGTAAAGTC 
GGTAACGAAT 
TAATTATGTT 
TACTTAAATT 
TCATCATGCC 
ATTTTTCTAA 
GTTG CATAAC 
AAAAAAGAAA 



CTAGTGTCCAi 

atatta jtat ! 
tccagtacat: 
tacaaaaaag: 
tcacactttt: 
tcatagtgaa: 
tgcatgaaat. 
gttagaaagt . 

TACATGAAAA 
ACTTATTTAC 
GATTGACACC 
CTCCAAGCCT 
TTG CATTG A A 
TATG 9 



and Lbc3 with the sequence: 
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TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 
GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 
ATAAAAATAG TGAACATCGT CTAAGCATTT TTATAT AAGA TGAATTTTAA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 
5 TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCG7AGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 
CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 
TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAG CCAAG AGAGACATAA 
10 GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTCTATA TAAATAAGTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAAAAGAAAT ATG* 



Another example of a preferred DNA fragment accord- 
ing to the invention is a DNA fragment which is 
15 identical with, derived from or comprises 5' flank- 
ing regions of the Lbc 3 - 5 ' - 3 ' CAT gene with the 
sequence 



TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 
GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 
20 ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 
TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 
25 CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTC~C 
TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAG CCAAG AGAGACATAA 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA ^AA^AAGTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAATTCTAAA ATG 



30 Still another example of such a DNA fragment ac- 
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cording to the invention is a DNA fragment which 
is identical with, derived from or comprises 5' 
flanking regions of the N23 gene with the sequence 



10 20 30 40 50 60 70 

GAATTC GAGCTCGCCCGGGGATCGATCCTCTAGA XSTC 
ECORI Sail 

80 90 lOO HO 12 Q 130 140 

5 TTCTATTGAGACACGJVTTTGAACAATTTTTACAT^ 

150 160 170 180 290 200 210 

AAATTTAAAGCTTTAGATGATGATGAATTGAANNAATAOTGTATTAAT 

220 230 240 250 260 270 280 

ATGAATGCTATGATATTGATGGTCTTGATNTATTNNCAGAATTG^ 

290 30O 310 320 330 340 ^35Q 

10 AGAAGTTAGCACACCAATAGAAGTATTGAGTTATATTAAAACTTTAGATTCTTTTCAAATGTTTACATTG 

3 60 370 380 390 4O0 410 420 

CATATAGAATTTTATTGACAATCCTTATAACAGTTGCTACT 

430 440 4SO 460 470 480 490 

ACTTAAATCATATCTAAAATCAACAATGTTACAAGATAGA 

500 510 520 530 540 550 560 

2JS AGTAAAGTGOTAGAATTGTTTGATTATAAAACTCTGATAAATGATTTTGCA 

570 580 590 600 610 620 630 

TAATATAAAAATTGATATTTTATATAATATATTAAGTCT 

640 6SO 6 60 670 680 690 7O0 

AAAT AAT AAAAT AAAGCAACTCTTAATTTT AATGAAACATCC CTTTGTTAAACC GT 



20 



710 720 730 740 7SO 760 770 

AAAAATTAATGCTTGATGGAAGTTTTTAATTTGTTCTACTCIAATACT 

780 790 800 810 820 830 840 

TATCATTTATATGTTGTAAATATGAATGCACTAGTAATTAGTTTAATGATAAAATATAa?TCTAC^GATAT 

850 860 870 880 890 900 910 

ATTTCTGTCTCTTGGCAACTCGTGAGAATTGAATATA 

920 930 940 950 960 970 - 980 

2 5 AGAATAAATATTTATATACAATTCCTAGATTTTGTTATAAAATTCACATATTGTA 

990 100O 1010 1020 10 3 O 1040 1050 

gagcacacaccaaactagtctcaaattaagtaaggtgctaattattagcggctagctaagtaaccaagta 

ATTAATG 
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The invention relates furthermore to any plasmid 
to be. used when carrying out the method according 
to the invention and characterised by comprising a 
DNA fragment containing an inducible plant promoter 
5 as herein defined. Particular examples of suitable 
plasmids according to the invention are pARll, 
pAR29, pAR30 f and N23-CAT, cf. Examples 3, 4, and 
11- These plasmids allow recombination into the A , 
rhizogenes T DNA region. 

10 Tne invention relates furthermore to any Agrobac- 
terium strain to be used in connection with the 
invention and characterised by comprising a DNA 
fragment comprising an inducible plant promoter of 
root nodule-specific genes built into the T DNA 

15 region and therefore capable of transforming the 
inducible promoter into plants. Particular examples 
of bacterium strains according to the invention are 
the A. rhizogenes strains AR1127 carrying pAR29 , 
AR1134 carrying pAR30 f AR1000 carrying pARll, and 

20 AR204-N23-CAT carrying N23-CAT. 

It is obvious that the patent protection of the 
present invention is not limited by the embodiments 
stated above. 

Th u s t h e i nve n t i on employs not exclusively 5 ' flan- 
25 king regions of soybean leghemoglobin genes . It is 
well-known that the leghemoglobin genes of all 
leguminous plants have the same, function, cf. Apple- 
by (1974) in The Biology of Nitrogen Fixation, 
Quispel. A. Ed. Nor th -Ho Hand Publishing Company, 
30 Amsterdam, Oxford, pages 499-554, and concerning the 
kidney bean PvLbl gene it has furthermore been 
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proved that a high degree of homogoly exists with 
the sequences of the soybean L.DC3 gene. It is also 
known that the expression of other root nodule- 
specific genes is regulated in a similar manner 
5 like the leghemo glob in genes . The invention includes 
thus the use of 5' flanking regions of leghemo glob in 
genes or other root nodule - spec if ic genes of all 
plants in case the use of such DNA fragments makes 
the expression of a desired gene product the subject 
10 matt «^ of the regulation characterised by the pre- 
sent invention. 

The present invention allows also the use of such 
fragments of any origin which under natural con- 
ditions exert or mediate the regulation charac- 
]5 terised by the present invention. The latter applies 
especially to such fragments which can be isolated 
from DNA fragments from gene libraries or genomes 
through hybridization with labelled sequences of 5' 
flanking regions of soybean leghemoglobin genes. 

2o It: is well-known that it is possible to alter nuc- 
leotide sequences of non- important sub-regions of 
5' flanking regions without causing an alteration 
of the promoter activity and the regulation. It is 
also well-known that an alteration of sequences of 

25 important subregions of 5' flanking regions renders 
it possible to alter the binding affinities between 
nucleotide sequences and the factors or effector 
substances necessary or responsible for the trans- 
cription initation and the translation initiation 

3Qand consequently to improve the promoter activity 
and/or the regulation. The present invention in- 
cludes, of course, also the use of DNA fragments 
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containing such altered sequences of 5 ' flanking 
regions, and in particular DNA fragments can be 
mentioned which have been produced by recombining 
sequences of 5' flanking regions of any gene with 
5 5' flanking regions of root nodule- specif ic genes 
provided the use of such DNA fragments subjects 
the expression of a desired gene product to the 
regulation characterised by the present invention. 

It should be noted that the transformation of micro- 
lO organisms is carried out in a manner known per se, 
cf. e.g. Maniatis et al., (1982), Kolecular Cloning, 
A Laboratory Manual, Cold Spring Harbor Laboratory. 

The transformation of plant cells, i.e. introduction 
of plasmid DNA into plant cells, is also carried 
15 out in a manner known per se, cf. Zambryski et 
al., (1983), EMBO J. 2., 2143-2150. 

Cleavage with restriction endonucleases and di- 
gestion with other DNA modifying enzymes are well- 
known techniques and are carried out as recommended 
20 by the suppliers. 

The Aeroba cterium rhizo genes 15834 rif R was used 
as a typical representative of A. rhizbgenes : see 
White et al., I.Bact., Vol. 141 (1980), 1134-1141. 

Example 1 

25 Sequence determination of 5' flanking regions of 
soybean leghemoglobin genes 



From a soybean gene library the four soybean leg- 
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hemoglobin genes Lba, Lbc^, LbC2» and LbC3 are 
provided as described by Jensen, E.0. et al. , Nature 
Vol. 291, No. 3817, 677-679 (1981). The genetically 
stable in-bred invariable soybean species "Glycine 
5 max. var. Evans" was used as a starting material for 
the isolation of the DNA used for the construction 
of said gene library. The 5' flanking regions of 
the four soybean leghemoglobin genes are isolated, 
as described by Jensen, E.0., Ph D Thesis, Institut 
1Q f or Molekylar Biologi, Arhus Universitet (1985), 
and the DNA sequences determined by the use of the 
dideoxy method as described by Sanger, F . , J. Mol . 
Bio. 143, 161-178 (1980) and indicated in the se- 
quence scheme. 

35 Example 2 

Construction of Lbc 3 - 5 ' - 3 ' -CAT 

The construction has been carried out in a sequence 
of process steps as described below: 

a) Sub-cloning the Lbc ? gene 

20 The Lbc 3 gene was isolated on a 12Kb EcoRI restric- 
tion fragment from a soybean DNA library, which 
has been described by Wiborg et al . , in Nucl. Acids 
Res. (1982) 10, 3487. A section of the fragment is 
shown at the top of the attached Scheme 2. This 

25 fragment was digested by the enzymes stated and 
then ligated to pBR322 as indicated at the Scheme. 
The resulting plasmids Lbc 3 HH and Lbc 3 HX were sub- 
sequently digested by PvuII and religated, which 
resulted in two plasmids called pLpHH and pLpHX. 
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b) Sub -cloning 5 'flanking sequences from the Lbc 3 

gene 

For this purpose pLpHH was used as shown in the 
attached Scheme 3. This plasmid was opened by means 
5 of PyuII and treated with exonuclease Bal31. The 
reaction was stopped at various times and the 
shortened plasmids were ligated into fragments from 
pBR322. These fragments had been treated in advance 
as shown in Scheme 3, in such a manner that in one 
lO end they had a DNA sequence TTC 

AAG m 

After the ligation a digestion with EcoRI took 
place, and the fragments containing 5' flanking 
sequences were ligated into EcoRI digested pBR322. 

15 These plasmids were transformed into E. coli K803 . 
and the plasmids in the transf ormants were tested 
by sequence analysis. A plasmid, p213 5 ' Lb f isolated 
from one of the transf ormants , contained a 5' flan- 
king sequence terminating 7 bp before the Lb ATG 

20 start codon in such a manner that the sequence is 
as follows : 

2Kb 

-5' flanking AAAGTAGAATTC 

Lbc3 sequence 

25 E, coli K803 is a typical representative of the E. 
coli K12 recipient strains. 



c) Sub-cloning 3' flanking region of the Lbc 3 
gene 
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For this purpose pLpHX was used which was digested 
by XhoII. The ends were partially filled out and 
excess single-stranded DNA was removed with SI 
nuclease, as shown in the attached Scheme 4. The 
5 fragment shown was ligated into pBR322 which had 
been pretreated as shown in the Scheme. The con- 
struction was transformed into E. col i K803. One 
of the transf ormants contained a plasmid called 
Xho2a-3'Lb. As the XhoII recognition sequence is 
lO positioned immediately after the Lb stop codon, cf . 
Scheme 2, the plasmid contained about 900 bp of 
the 3' flanking region, and the sequence started 
with GAATTCTACAA . 

The construction of Lb promoter cassette 

15 An EcoRI/SphI fragment from Xho2a-3'Lb was mixed 
with a BamHI/EcoRI fragment from p213-5'Lb. These 
two fragments were, ligated via the BamHI/SphI cleav- 
age sites into a pBR322 derivative where the EcoRI 
recognition sequence had been removed, cf . Scheme 

2q4. The ligated plasmids were transformed into IjL_ 
coli K803 . A plasmid in one of the transf ormants 
contained the correct fragments, and it was called 
pEJLb 5' - 3' -1. 

Construct ion of the Lbc ? 5'3'-CAT gene 

25The CAT gene of pBR322 was isolated on several 
smaller restriction fragments, as shown in the 
attached Scheme 5. The 5' coding region was isolated 
as an Alul fragment which was subsequently ligated 
into pBR322, treated as stated in the Scheme. This 
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was transformed Into E > coli K803 . Several trans- 
formants contained the correct plasmid. One was 
taken, out and called Alull. The 3 r coding region 
was isolated on a TaqI fragment. This fragment was 
5 treated with exonuclease Bal31 f whereafter EcoRI 
linkers were added. Then followed a digestion with 
EcoRI and a ligation to EcoRI digested pBR322. The 
latter was transformed into E. coli K803 and the 
transf ormants were analysed. A plasmid, Taq 12, 

lo contained the 3' coding region of the CAT gene 
plus 23 bp 3' flanking sequences subsequently term- 
inating In the following sequence CCCCGAATTC. 
Subsequently the following fragments were ligated 
together to EcoRI digested 

15 pEJLbS' -3 ' -1 : EcoRI/PvuII fragment from Alul , 
PvuII/Ddel fragment from pBR322 and Ddel/EcoRI 
fragment from Taq 12. This ligation mixture was 
transformed into E . coli K803 . Several trans formants 
contained the correct plasmid. One was taken out 

2o and was called pEJLb 5' -3' CAT 15. 

Example 3 
a . 

Cloning and integration of the soybean Lbc 3 ~5'-3 r - 
CAT gene . 

25 Two EcoRI fragments (No. 36 and No. 40) of the T L - 
DNA region of A, rhizo genes 15 834 pRi plasmid was 
used as "integration sites". Thus the Lbc3-5 r -3- 
CAT gene was subcloned (as 3,6 Kb BamHI/Sall frag- 
ment) Into two vectors pARl and pAR22 carrying the 

3Qabove EcoRI fragments. The resulting plasmids pAR29 
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and pAR30 were separately mobilized into A. rhi- 
zoeenes 15834 rif R using a plasmid helper system; 
see E. van Haute et al . (1983), EMBO J. 3, 411- 
417. Neither pAR29 nor pAR30 can replicate in Agro- 
5 bacterium. Therefore the selection by means of 
rifampicin 100 fig/ml and the plasmid markers spec- 
tinomycine 100 jug/ml , s tr eptomyc ine 100 /ig/ml or 
kanamycine 300 pg/ml will select A. rhizopenes 
bacteria having integrated the plasmids via homo- 

10 logous recombination through the EcoRI fragments 
36 or 40. The structure of the resulting T L -DNA 
regions - transferred to the transformed plant 
lines L5-9 and L6-23 - has been indicated at the 
bottom of the attached Scheme 6. In this Scheme is 

15 furthermore for the L6-23 line shown the EcoRI and 
Hindlll fragments carrying the Lbc3 - 5 ' - 3 ' - CAT gene 
and therefore hybridizing to r adioac t ively labelled 
Lbc3-5 ' -3 ' -CAT DNA used as a probe, cf. Example 
4a. 

20b^ 

Cloning and integrat ion of the soybean Lbc 3 gene. 

The EcoRI fragment No. 40 has here been used as 
"integration site". The Lbc3 gene was therefore 
sub-cloned (as a 3,6 Kb BamHI fragment into the 

25pARl vector and transferred into the T^- DNA region 
as stated in a. The structure of the T^- DNA region, 
transferred to the transformed plant line L8-35, 
has been shown at the bottom of the attached Scheme 
7. This Scheme furthermore shows the EcoRI and 

30HindIII fragments carrying the Lbc3 gene and there- 



0249676 



42 

fore hybridizing with radioactively labelled Lbc 3 
DNA used as a probe, cf , Example 4b. 
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Example 4 . 



Demonstration of the soybean Lbc 3 - 5 ' - 3 ' - CAT gene in 
transformed plants of bird's-foo t trefoil. 
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DNA extracted from transformed lines (L6-23) or 
untrans formed control plants and cleaved by the 
restriction enzymes EcoRI and Hindlll was analyzed 
by Southern-hybridization. Radioac tively labelled 
5 Lbc3«5 9 -3 ' -CAT gene was used as a probe for demon- 
strating corresponding sequences in the transformed 
lines. The bands marked with numbers correspond to 
restriction fragments constituting parts of the 
Lb c 3 - 5 ' - 3 1 - CAT gene as stated in the restriction 
lO map (Scheme 6) of Example 3a. 



Demonstration of the soybean Lbc 3 gene of trans- 
formed plants of bird's-foot trefoil. 
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DNA extracted from transformed lines (L8-35) or 
untrans formed control plants and cleaved by the 
restriction enzymes EcoRI and Hindlll was analyzed 
by Southern-hybridization. Radioactive Lbc3 gene 
was used as a probe for detecting corresponding 
sequences in the transformed lines. The bands marked 
with numbers correspond to restriction fragments 
constituting parts of the LbC3 gene as stated in 
the restriction map (Scheme 7) f Example 3b. 
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Example 5 



Expression of the Lbc 3 - 5 ' - 3 ' - CAT gene in various 
tissues of bird's-foot trefoil. 



1 2 3 4 5 6 

Un transformed L6-23 

R N LS R N LS 



7 8 -9 .10 11 12 

L5 -9 _.' Lbc,, transformed 

R N LS " R J N LS 



<- 3Ac Cm 
<-1AcCm 

«-Cm 
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The activity of the chlor oamphenic ol acetyl trans- 
ferase (CAT) enzyme is. measured as the amount of 
acetylated chloroamphenicol (AcCm) produced from 
14 C-chloroamphenicol . In (a) the acetylated forms 
5 lAcCm and 3AcCm appear, which have been separated 
from Cm through thin-layer chromatography in chloro- 
form/me thanol (95:5). The columns 1-3 show that no 
CAT activity occurs in root (R) , nodule (N) , as 
well as leaves + stem (LS) of untrans formed plants 

lO of bird's-foot trefoil. The columns 4-6 and 7-9 
show the CAT activity in corresponding tissues of 
Lbc 3 - 5 ' - 3 ' - CAT transformed L6-23 and L5-9 plants. 
The conversion of chloroamphenicol in columns 5 
and 8 shows the organ-specific, expression of the 

15 Lbc3-5' -3' -CAT gene in root nodules. The columns 
10-12 show the lack of CAT activity in plants trans- 
formed with the Lbc3 gene. 



Table 

20 L6-23 L5-9 

CAT activity CAT activity 

Root 0 0 

Nodule 6883Q cpn/yg protein-h 154,000 cpm/vg protein-h 

Leaves + 

25 Stem 0 0 



In the Table (b) the CAT activity in Lbc 3 - 5 ' - 3 ' - CAT 
transformed L5 - 9 and L6-23 plants has been stated 
as the amount of 14 C - chloroamphenico 1 converted 
into acetylated derivatives. The amount of radio- 
30 activity in the acetylated derivatives has been 



8 
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counted by liquid scintillation and stated in cpm/pg 
protein • hour. 

Example 6 

Transcription test (Northern analysis) on tissues 
5 of Lbc 3 -5' -3' -CAT transformed and Lbc 3 transformed 
Lotus plant lines. 
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5 pg of total RNA extracted from root (R) , nodule 
(N) or leaves 4- stem (LS) and separated in formal- 
dehyde agarose gels were transferred onto nitro- 
cellulose. Column 1 contains 5 /xg of total RNA from 
5 20 -day- old soybean nodules as control plants. The 
columns 2*-4 and. 5-7 contain total RNA from root, 
nodule or leaves + stem, respectively, of the LDC3- 
5' -3' -CAT transformed lines L5-9 and L6-23. The 
columns 8-10 contain RNA from corresponding tissues 

10 of bird's-foot trefoil transformed by means of A . 
rhizo genes carrying the . Lbc3 gene in the Tl-DNA. 
In (a) radioactive DNA of the CAT coding sequence 
has been used as a probe for hybridization. The 
organ- specif ic transcription of the Lbc3-5'-3'- 

15 CAT gene in root nodules from the L5 - 9 and L6-23 
lines appears from columns 3 and 6. In (b) the 
transcript for the r cons ti tutive ubiquitine gene(s) 
is visualized using a cDNA probe for the human 
ubiquitine gene for the hybridization. In (c) the 

20 nodule - specific transcription of bird's-foot trefoil 
own leghemoglobin genes is shown. A cDNA probe of 
the Lba gene of soybean has been used for this 
hybridization . 
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Example 7 

Determination of the trar^scriptian initiation site 
(CAP site^ of the Lbc 3 promoter of soybean in trans- 
formed root nodules of bird's-goot trefoil, 
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The position of the M CAP site" was determined on 
the nucleotide level by means of primer extension. 
A synthetic oligonucleotide 5 ' CAACGGTGGTATATCCAGTG3 9 
complementary to the nucleotides 15-34 in the coding 
5 sequence of the CAT gene was used as primer for 
the enzyme reverse transcriptase. As a result sin- 
gle-stranded cDNA was formed the length of which 
corresponds to the distance between the 5' end of 
the primer and the 5 9 end of the primed mRNA. A 83 

10 nucleotide cDNA strand would be expected according 
to the knowledge of the transcription initiation 
site of soybean Lbc3 gene. Columns 2, 3, and 4 
from left to right show the produced DNA strands 
when the primer extension has been operated on 

15 polyA + -pur if ied mRNA from transformed root nodules 
of bird's-foot trefoil, transformed leaves + stem 
of bird's-foot trefoil, and untransf ormed root 
nodules of bird's-foot trefoil, respectively. The 
85, 86, 87, 88, and 90 nucleotides long cDNA strand 

2o shown in column 2 proved correctly Lbc 3 promoter 
function in bird's-foot trefoil. The CAP sites 
corresponding to the cDNA sequences generated are 
indicated with asterisks (*) on the partial se- 
quence of the Lbc 3 5 '3' -CAT region given. In the 

25 sequence the TATA box of the Lbc 3 promoter and the 
corresponding translation initiation codon of the 
CAT coding sequence are underlined. 
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Example 8 

Demonstration of the correct developmental control 
of the Lb c 3 - 5 ' - 3 ' - CAT gene in transformed plants of 
bird's-foot trefoil (L6-23). 




5 CAT activity 

ft 

in cpm/pg protein- hour 0 0 32.6 342.3 1255 
Nitrogenase activity 

nmol ethylene//xg protein 0 O 0 0.5 2.7 

• hour 

lO * Substrate limited reaction; actual activity about 
68000 cpm//*g protein • hour. 

Chloramphenicol acetyl transferase and nitrogenase 
activity were measured on cut off pieces of root 
with nodules at the different developmental stages 

15 indicated. The CAT activity can be detected in the 
white distinct nodules whereas the nitrogenase 
activity did not appear until the small pink nodules 
have developed. The latter development corresponds 
to the development known from soybean control plants 

20 and described by Marcker et al. EMBO J. 1984, 3, 
1691-95. The CAT activity was determined as in 
Example 5. The nitrogenase activity was measured 
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as acetylene reduction capacity of the nodules 
followed by gas chromatographic determination of 
ethylene. 

Example 9 

5 Demonstration of LDC 3 protein in b ird' s-foot trefoil 
plants trans f ormed with the soybean Lbc i gene , 




Proteins extracted from root nodules of Lbc3 trans- 
formed (L8-35), Lb c 3 - 5 ' - 3 ' - CAT transformed and 
nontransf ormed plants were separated by isolectric 

10 focussing at a pH gradient of 4 to 5 . The columns 
1, 3, 5, 7, and 9 show Lbc^, Lbc£ , Lbc3 , and Lba 
proteins synthesized in soybean control root nod- 
ules. Column 2 shows proteins from root nodules of 
Lbc 3 -5' -3' - CAT trans f ormed L6 - 23 -bird' s - foot trefoil 

13 plants, whereas the columns 6 and 8 show proteins 
from nontransf ormed plants. The columns 4 and 10 
show soybean Lbc3 protein synthesized in root nod- 
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ules of bird's-foot trefoil plants (L8-35) trans- 
formed with the Lbc3 gene. The 1-003 protein band 
is indicated by an arrow. 

Example 10 

5 Expression of the Lb c 3 - 5 ' - 3 ' - CAT gene requires the 
5' Lbc 3 promoter region, 

The Lbc3 - 5 ' - 3 ' -CAT gene construction carries a 2 Kb * 
5' Lbc3 promoter region. Stepwise removal of se- 
quences from the 5' end of this region demonstrated 
1q that this promoter region is required for the char- 
acteristic expression of the Lbc3 - 5 ' 3 ' - CAT gene. 



Sail 
h 

The Lbc3 - 5 9 - 3 9 - CAT gene construction was opened in 
23 the unique Xbal site shown above, and digested with 
the exonuclease Bal31. A Sail linker fragment was 
ligated onto the blunt ends generated and the short- 
ened Sail fragments carrying the Lbc3 - 5 9 - 3 9 - CAT gene 
were transferred into L . corniculatus . The effect 
2o °f removing promoter sequences was measured as CAT 
activity. End points of the deleted 5' region are 
given as the distance from the CAP site in nucleo- 
tides . 



5'Lbc 3 3'Lbc 3 
Xbal EZZZj -J SaH 

2 Kb. • 
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CAT activity 

5'Lbc~ 3'Lbc 0 Cpm/>/g protein/hrs. 



2000 Root Nodule Leaf 

^ CAT 1 ' 0 80000 0 

-950 1 : ■ 1 ' 0 10000 0 

-474 • i 1 — ' 1 0 3000 0 

-230 1—1 I i o 3000 0 

-7« i o o o 



5 The drastically reduced level of CAT activity ex- 
pressed from the L.DC3 promoter deleted to nucleotide 
-230 and the zero activity from the promoter deleted 
to nucleotide -78 demonstrates that the LDC3 pro- 
moter region is required for the root nodule spe- 
IO cif ic expression of the Lb c 3 - 5 ' - 3 ' - CAT gene. 

Example 11 

Construction of the N2 3 - CAT gene. 

The N23 gene was isolated from a soybean DNA library 
as described in the enclosed paper of Sandal, Bojsen 

15 and Marcker. The N23-CAT gene was constructed from 
the modified Lbc3 - 5 ' - 3 ' - CAT gene carried on plasmid 
pEJ5 ' - 3 ' -CAT101 as described in the Applicant's 
copending application No. 86 11 4704.9 concerning 
"Expression of Genes in Yeast", and a 1 Kb . EcoRI , 

2oDdeI fragment containing the N23 5' promoter region. 
The position of the EcoRI and Ddel sites in the 
N23 promoter region is indicated on the DNA sequence 
shown below. The cloning procedure used is outlined 
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below. The disclosure of the papers of Sandal et 
al. , the EP application, and the paper of Jensen 
et al., Nature 321 (12 June 1986), 669-674, includ- 
ing the references cited should be considered in- 
5 corporated into the present description as a means 
to amend, illustrate, and clarify it. 

The N23-CAT gene was transferred to plants by the 
same method as the Lbc3 - 5 9 - 3 ' - CAT gene. 
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BamHI 




O Qi 
0 03 -O 



H * 

N23 5"promoter 



BamHI/Bglll digested 
Klenow endfilled 



Klenow endfilled 




Sail Sail 
aTT aigescea 7 



Not to scale 
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DNA sequence of the 5' -promo tor region from the 
N23 gene 



lO 20 30 40 50 60 70 

!(^^TCGATCCTgTA(^ GTCGAC CTGCAGCCCAAGCTTGGATCRATCAATTAA 

5 TTCTATTGAGACACGATTTGAACAATTTT^ ATTTGATCCAAAA 

AAATTTAAAGCTTTAGATGM?GAT 

220 230 240 250 260 270 280 

ATGAATGCTATGATATTGATGGTCTTGATNTATTNNCAGAATTGAAAGTATTAAGAGAAGTGTTA^ 

290 300 310 320 330 340 350 

JjQ agaagttagcacaccaatagaagtattgagttatattaaaactttagato 

360 370 380 390 400 410 420 

CATATAGAATTTTATTGACAATCCTTATAA 

430 440 450 460 470 480 490 

ACTT AAATCATATCTAAAATCAACAATGTT ACAAGAT AGATT GAAT GAGTTAGTTATTTTATCT ATTGAA 

5O0 SIO 520 530 540 550 560 

15 AGTAAAGTGTTAGAATTGTTTGA3^ATAAAACTCTGATAAATGATTTTGCAGTTAAA 

570 580 590 60O 610 620 63Q 

TAATATAAAAATTGATATTTTATAIAATATATT^ 

640 650 660 670 680 690 70O 

AAATAATAAAATAAAGCAACTCOTAATTTTAATGAAACATCCCTTTGTTA 

7lO 720 730 740 750 760 __77Q 

20 AAAAATTAATGCTTGATGGAAGTTTTT AATTTGTTCTACTCAATACT AAAT ATTTTTTT 

780 790 800 810 820 830 840 

TATCATTTATATGTTteTAAATATGAATGCACTAGTAAT^ 

850 860 870 880 890 900 91Q 

rATATTATAAAGATGAAAGGTCGTTACAATTTTTTTT 

920 930 940 950 960 970 9BO 

25 AGAATAAATATTTATATACAATTCCTAGATTTTGT 

990 lOOO 1010 1020 1030 1040 1050 

GAGCACACACCAAACTAGTCTCAAATTAAGTAAGGTGCTAATTATO^ 

DdeX 

ATTAATG 
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Example 12 

Organ>speGific expression of the soybean N2 3-CAT 

gene in root nodules of L.corniculatus and Trif olium 
repens . 



5The activity of chloramphenicol acetyl transferase 
(CAT) was measured as in example 5 and is given in 
cpm//ig protein/hrs . 



Table a. CAT activity 

N2 3 - CAT transformed Untrans formed 

10 L. corniculatus L » corniculatus 

Root nodule 86150 0 

Root 0 0 

Table b . CAT activity 

N2 3 - CAT transformed Untrans formed 

15 T „ repens T . repens 

Root nodule 148000 0 

Root 0 0 



Table (a) and b) shows the organ-specific expression 
of the N23-CAT. gene in root nodules of L . cornicu- 
20 latus and T . repens . L. corniculatus was inoculated 
with Rhizob ium loti . while T . repens was inoculated 
with Rhizobium trif olii . 

In connection with the invention it has thus been 
proved that root nodule - specif ic genes can be ex- 
25 pressed organ- specif ically upon transfer to other 
plants, here Lotus corniculatus and Tr If olium re - 
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pens . It has furthermore been proved that the 5 ' 
flanking regions comprising the promoter are con- 
trolled by the organ-specific regulatory mechanism 
as the organ- specif ic control of the I/DC3 - 5 9 - 3 ' - GAT 
5 gene in Lotus eorniculatus took place at the trans- 
cription level. The Lbc 3 - 5 ' - 3 ' - CAT gene transferred 
was thus only transcribed in root nodules of trans- 
formed plants and not in other organs such as roots, 
stems, and leaves. 

IO The expression of the Lbc 3 - 5 ' - 3 ' - CAT gene in root 
nodules of transformed plants also followed the 
developmental timing known from soybean root nod- 
ules. No CAT activity could be detected in roots 
or small white root nodules (Example 8) - A low 

^5 activity was present in the further developed white 
distinct nodules, whereas a high activity could be 
measured in the small pink nodules and mature nod- 
ules developed later on. 

The organ-specific expression and the correct de- 
20 velopmental expression of transferred root nodule- 
specific genes, here exemplified by the Lbc3-5'-3'- 
CAT gene, allows as a particular use a functional 
expression of root nodule- specif ic genes also in 
other plants beyond leguminous plants. When all 
25 the root nodule- specif ic plant genes necessary for 
the formation of root nodules are transferred from 
a leguminous plant to a non-root -nodule - forming 
plant species, the correct organ- specif ic expres- 
sion proved above allows production of functionally 
30active, nitrogen- fixing root nodules on this plant 
upon infection by Rhizobium . In this manner these 
plants can gr w without the supply of external 



• 
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inorganic or organic nitrogen compounds. Root nod- 
ule-specific promoters, here exemplified by the 
Lbc3 and N23 promoters, must be used in the present 
case for regulating the expression of the trans- 
5 f erred genes. 

According to the present invention a root nodule- 
specific promoter is used for expressing genes. 
The gene product or function of the gene product 
improves the function of the root nodule, e.g. by 
lO altering the oxygen transport, the metabolism, the 
nitrogen fixation or the nitrogen absorption. 



Root nodules are thus used for the synthesis of 
biological products improving the plant per se or 
which can be extracted from the plant later on. A 
15 root nodule - specif ic promoter can be used for ex- 
pressing a gene. The gene product or compound formed 
by said gene product constitute the desired pro- 
duct ( s ) . 



In connection with the present invention it has 
20 furthermore been proved that the soybean Lbc3 leg- 
hemoglobin protein per se, i.e. the Lbc3 gene pro- 
duct, is present in a high concentration in root 
nodules of bird's-foot trefoil plants expressing 
the Lbc3 code sequence under the control of the 
25 Lbc3 promoter. The latter has been proved by cloning 
the genomic Lbc3 gene of the soybean into the in- 
tegration vector pARl , said genomic Lbc3 gene con- 
taining the coding sequence, the intervening se- 
quences, and the 5' and 3' flanking sequences. A 
30 3 . 6 Kb BamHI fragment Lbc 3HH , cf. Example 2, was 
cloned into the pARl plasmid and transferred to 
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bird's-foot trefoil as stated previously. 

The high level of L.DC3 protein, cf. Example 9, 
found in transformed root nodules of bird's -foot 
trefoil and corresponding to the level in soybean 
5 root nodules proves an efficient transcription of 
the Lbc3 promoter and an efficient processing and 
translation of Lbc3mRNA in bird's-foot trefoil. 

The high level of the CAT activity present in trans- 
formed root nodules is also a result of an efficient 

IO translation of mRNA formed from the chimeric Lbc3 
gene. The leader sequence on the Lbc3 gene is de- 
cisive for the translation initiation and must 
determine the final translation efficiency. This 
efficiency is of importance for an efficient syn- 

15 thesis of gene products in plants or plant cells. 
An Lbc3 or another leghemoglob in leader sequence 
can thus be used for increasing the final expression 
level of a predetermined plant promoter. The con- 
struction of a DNA fragment comprising a Lb leader 

20 se< I uence as first sequence and an arbitrary promoter 
as second sequence is a particular use of the in- 
vention when the construction is transferred and 
expressed in plants. 

During nodule development around 30 different plant 
25 encoded polypeptides (nodulins) are specifically 
synthesized. Apart from the leghemoglob ins , nod- 
ulins include nodule - specif ic forms of uricase 
(Bergmann et al (1983) EMBO . J. 2, 2333-2339), 
glutamine synthetase (Cullimore et al (1984) J.Mol. 
3QAppl. Genetics 2, 589-599) and sucrose synthase 
(Morell and Copeland (1985) Plant. Physiol. 78, 
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149-154). The function of most nodulins are, how- 
ever, at present unknown. 

Many nodulin genes have nevertheless been isolated 
and characterised during the last five years. These 
5 include nodulins from several different legumes. 
Examples of such isolations and characterisations 
are widespread in the literature such as (Fuller et 
al (1983) Proc. Natl. Acad.Sci. 80, 2594-2598), 
(Sengupta-Gopalan et al (1986) Molec. Gen. Genet. 

10 203, 410-420), (Bisseling et al (1985) in Proceed- 
ings of the 6th Int. symp . on Nitrogen Fixation, 
Martinus Nijhoff Publishers pp 53-59.), and (Geb- 
hardt et al (1986) EMB0.J.5, 1429-1435). All of 
these genes contain nodule - spec if ic regulatory 

15 sequences. Such sequences and in fact entire 5 ' 
flanking regions and 3' flanking regions can fur- 
thermore be synthesized by automated oligonucleotide 
synthesis knowing the DNA sequences for the Lbc3 
and N23 genes given in this description. Entire 

20 nodule - spec if ic genes can also be isolated with 
known recombinant techniques as described in the 
above papers and by (Maniatis et al (1982) Mole- 
cular cloning. A Laboratory Manual, Cold Spring 
Harbour ' Laboratory , New York). 

25 The described method to obtain nodule - spec if ic 
expression of genes can thus be reconstructed and 
performed according to the invention by any one 
skilled in the art of molecular genetics. 

The method to obtain nodule - specif ic expression is 
30 not dependent on the A. rhizogenes plant transforma- 
tion described. Any other plant transformation 
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system e.g. A, tumefaciens systems, direct gene 

transfer or microinjection can equally be applied. 

The — rhlzogenes system has been used and charac- 
terised by a number of scientific groups and is 
5 thus well-known from the literature. The character- 
istics of the system is described in: 

Willmitzer et al . (1982), Molec.Gen. 
Genet. 186, 16-22, 

Chilton et al. (1982), Nature 295, 432-434, 

lO Simpson et al. (1986), Plant . Molec . Biol . 

6, 493-415, 

Tepfer D. (1983), Molecular Genetics of 
the Bacteria - Plant interaction, 

Springer Verlag, Berlin Heidelberg pp 
15 248-258, 

White and Nester (1980), J.Bact. 144, 
710-720, 

Jaynes and Strobel (1981), Int. Rev. of Cytol. 
Sup. 13, 105-125, 

20 White and Nester (1980), J. Bact. 141, 

1134-1141, 

Pomponi et al . (1983), Plasmid 10, 119- 
12 9, and 



0249676 



65 

Slightom et al. (1986), J. Biol. Chem. 
261, 108-121. 

The latter two publications describe the restriction 
map and nucleotide sequence of the A. rhizogenes 
5TL-DNA segment used in the transformation system de- 
scribed here. With this information it is possible 
to anybody skilled in molecular genetics to use 
and reconstruct the "intermediate vectors" and the 
A . rhizogenes strains described here. 
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Claims : 

1. A method of expressing genes in plants, parts 
of plants, and plant cell cultures by introducing 
into a cell, thereof a recombinant DNA segment con- 

5 taining both the gene to be expressed and a 5' 
flanking region comprising a promoter sequence, 
and optionally a 3' flanking region, and culturing 
of the transformed cells in a growth medium, 
characterised by using as °the recom- 
lO binant DNA segment a DNA fragment comprising an 
inducible plant promoter (as defined) from root 
nodule- specific genes . 

2. A method as claimed in claim 1, char- 
acterised by using a DNA fragment com- 

15 prising an inducible plant promoter (as defined) 
and being identical with, derived from or comprising 
5' flanking regions of root nodule-specific genes. 

3. A method as claimed in claim 2, char- 
acterised by using a DNA fragment com- 

2o prising an inducible plant promoter (as defined) 
and being identical with, derived from or comprising 
5' flanking regions of root nodule - specif ic genes, 
said DNA fragment causing an expression of a gene 
which is induced in root nodules at specific stages 

25 of development and as a step of the symbiosis, 
whereby nitrogen fixation occurs. 

4. A method as claimed in claims 1-3 for the 
expression of root nodule- specif ic genes, 
charac terised by using a DNA fragment 

30 comprising an inducible plant prom ter (as defined) 
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from root nodule - specif ic genes. 

5. A method as claimed in claims 1-3 for the 
expression of genes in leguminous plants, parts of 
leguminous plants, and leguminous plant cell cul- 

5 tures, characterised by using a DNA 
fragment comprising an inducible plant promoter 
(as defined) from root nodule - spec if ic genes. 

6. A method as claimed in claims 1-5, char- 
acterised by the DNA fragment comprising 

lO the inducible plant promoter and being identical 
with, derived from.or comprising 5' flanking regions 
of leghemoglobin genes. 

7 . A method as claimed in claim 6 , char- 
acterised by the DNA fragment comprising 

15 the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
of soybean leghemoglobin genes. 

8. A method as claimed in claim 7, char- 
acterised by the DNA fragment comprising 

20 the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
of the Lba gene with the sequence 

GAGATACATT ATAATAATCT CTCTAGTGTC TATTTATTAT TTTATCTGGT 
G AT AT AT AC C TTCTCGTATA CTGTTATTTT TTCAATCTTG TAGATTTACT 

25 TCTTTTATTT TTATAAAAAA GACTTTATTT TTTTAAAAAA AATAAAGTGA 
ATTTTGAAAA CATGCTCTTT GACAATTTTC TGTTTCCTTT TTCATCATTG 
GGTTAAATCT CATAGTGCCT CTATTCAATA ATTTGGGCTC AATTTAATTA 
GTAGAGTCTA CATAAAATTT ACCTTAATAG TAGAGAATAG AGAGTCTTGG 
AAAGTTGGTT TTTCTCGAGG AAGAAAGGAA ATGTTAAAAA CTGTGATATT 
TTTTTTTTGG ATTAATAGTT ATGTTTATAT GAAAACTGAA AATAAATAAA 
CTAACCATAT TAAATTTAGA ACAACACTTC AATTATTTTT TTAATTTG AT 
TAATTAAAAA ATTATTTGAT TAAATTTTTT AAAAGATCGT TGTTTCTTCT 
TCATCATGCT GATTGACACC CTCCACAAGC " CAAGAGAAAC ACATAAGCTT 

30 TGGTTTTCTC ACTCTCCAAG CCCTCTATAT AAACAAATAT TGGAG.TGAAG 
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TTGTTG CAT A ACTTGCATCG AACAATTAAT AGAAATAACA GAAAA7~AAA 
A A AG A A AT AT G. - ^ 

9. A method as claimed in claim 7, char- 

acterised by the DNA fragment comprising 
5 the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
of the Lbc^ gene with the sequence: 

TTCTCTTAAT ACAATGGAGT TTTTGTTGAA CATACATACA TTTAAAAAAA 
AATCTCTAGT GTCTATTTAC CCGGTGAGAA GCCTTCTCGT GTTTTACACA 
ID CTTTAATATT ATTATATCCT CAACCCCACA AAAAAGAATA CTGTTATATC 
TTTCCAAACC TGTAGATTTA TTTATTTATT TATTTATTTT TACAAAGGAG 
ACTTCAGAAA AGTAATTACA TAAAGATAGT GAACATCATT TTATTTATTA 



* UAa 



TATTGGGTGA AATCT CAT AG TGAAGCCATT AAATAATTTG GGCTCAAGT 
TTATTAGTAA AGTCTGCATG AAATTTAACT TAACAATAGA GAGAGTTTTC 
1R G A A AG G GAG C GAATG TT AAA AAGTGTGATA TTATATTTTA TTTCGATTAA 
10 TAATTATGTT TACATGAAAA CATACAAAAA AATACTTTTA AATTCAGAAT 
AATACTTAAA ATATTTATTT GCTTAATTGA TTAACTGAAA ATTATTTGAT 
TAGGATTTTG AAAAGATCAT TGGCTCTTCG TCATGCCGAT TGACACCCTC 
CACAAGCCAA GAGAAACTTA AGTTGTAAAC TTTCTCACTC CAAGCCTTCT 
AT AT A A AC AT GTATTGGATG TGAAGTTATT GCATAACTTG CATTGAACAA 
TAGAAAATAA CAAAAAAAAG TAAAAAAGTA GAAAAGAAAT ATG, 



20 10 • A method as claimed in claim 7, char- 
acterised by the DNA fragment comprising 
the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
of the Lbc£ gene with the sequence: 

25 TCGAGTTTTT . ACTGAACATA CATTTATTAA AAAAAACTCT CTAGTGTCCA 
TTTATTCGGC GAGAAGCCTT CTCGTGCTTT ACACACTTTA ATATTATTAT 
ATCCCCACCC CCACCAAAAA AAAAAAAACT GTTATATCTT TCCAGTACAT 
TTATTTCTTA TTTTTACAAA GGAAACTTCA CGAAAGTAAT TACAAAAAAG 
ATAGTGAACA TCATTTTTTT AGTTAAGATG AATTTTAAAA TCACACTTTT 
TTATATTTTT TTGTTACCCT TTTCATTATT GGGTGAAATC TCATAGTGAA 
ACTATTAAAT AGTTTGGGCT CAAGTTTTAT TAGTAAAGTC TGCATGAAAT 
TTAACTTAAT AATAGAGAGA GTTTTGGAAA GGTAACGAAT GTTAGAAAGT 

30 GTGATATTAT TATAGTTTTA TTTAGATTAA TAATTATGTT TACATGAAAA 
TTGACAATTT ATTTTTAAAA TTCAGAGTAA TACTTAAATT ACTTATTTAC 
TTTAAGATTT TGAAAAGATC ATTTGGCTCT TCATCATGCC GATTGACACC 
CTCCACAAGC CAAGAGAAAC TTAAGTTGTA ATTTTTCTAA CTCCAAGCCT 
TC TATATAAA CACGTATTGG ATGTGAAGTT GTTGCATAAC TTGCATTGAA 
CAATAGAAAT AACAACAAAG AAAATAAGTG AAAAAAGAAA TATG , 
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11. A method as claimed in claim 7, char- 
acterised by the DNA fragment comprising 
the inducible plant promoter and being identical 
with, derived from or comprising 5' flanking regions 
5 of the Lbc3 gene with the sequence: 

TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
GTACTATTTA AGAAAAGAAA AAA AAA AC CT G CT A CAT A AT TTCCAATCTT 
GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 
ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 

10 AAATATAATT TTTTTGT CT A AATCGTATGT ATCTTGTCTT AGAG CC ATTT 
TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAG TAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA G AAA AT C TG G AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TAT TTT ATTT TATATGGAAA 
CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 

15 TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAGCCAAG AGAGACATAA 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTCTATA TAAATAA GTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAAAAGAAAT ATG. 



12. A method as claimed in claim 7, c h a r a c- 
2o t e r i s e d by the DNA fragment comprising the 
inducible plant promoter and being identical with, 
derived from or comprising 5' flanking regions of 
the Lbc3 - 5 ' - 3 ' - CAT gene with the sequence: 

TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
25 GTACTATTTA AGAAAAGAAA . AAAA A A ACCT GCTACATAAT TTCCAATCTT 
GTAGAT.TTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 
ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAG CC ATTT 
TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
30 CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TAT TTT ATTT TATATGGAAA 
CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 
TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAGCCAAG AGAGACATAA 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA TAAATAAGTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAATTCT AAA ATG 
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13. A method as claimed In claim 5, charac- 
terised by the DNA fragment comprising the 
inducible plant promoter and being identical with, 
derived from or comprising 5' flanking regions of 
5 the N23 gene with the sequence: 



lO 20 30 40 50 60 70 

GAATTCCatfCTCGCCCGGGGATCGATCCTCT 
salZ 



TTCTATTGAGACACGATTTGAACAA^ 

lO AAATTTAAAGCTTTAGATG^ 

220 230 240 250 260 270 280 

ATGAATGCTASGATATTGATCXrcCTTGATO 

290 300 3lO 320 330 340 350 

AGAAGTTAGCACACCAATAGAAGTATTGAGTTAIAT^ 

360 370 380 390 400 4XO 420 

15 CATATAGAATTTTATTGACMTCCTIATAACAGT^^ 

430 440 450 460 470 480 490 

ACTT AAATCATAT CTAAAATCAACAAT GTTACAAGATAGATTGAATGAGTTAGTTATTTTATCTATTGAA 

BOO 510 520 530 540 550 560 

AGT AAAGTGTTAGAATTGTTTGATTATAAAACTCTGATAAATGATTTT GCAGTTAAAAAAACT AGAAGAT 



20 



570 580 590 600 610 620 630 

TAATATAAAAATTGATATTTTATATAATATATTAA 

640 650 660 670 6BO 690 7O0 

AAATAAXAAAATAAAGCAACTCTTAATTTTAATGAA^ 

710 720 730 740 7SO 760 770 

AAAAATTAATGCTTGATGGAAGTTTTTAATTTCTTCTACTCAATACT 

780 790 800 810 820 830 840 

25 TAT CATTT ATATGTTGTAAATATGAAT GCACTAGTAATT AGTTTAATGAT AAAAT AT ATTCT ACAGATAT 

850 860 870 880 890 90O __91Q 

rATATTATAAAGATGAAAGGT CGTT ACAATTTTTTTT 

920 930 940 95D 960 970 980 

AGAATAAATATTTATATACAATTCCT AGATTTTGTTATAAAATTCACATAOT GT AT GAGT ATAAAT ACAT 

990 1O0O 1010 1020 1030 104O 1050 

30 GAGCACACACCAAACTAGTCTCAAATTAAGTAAGGTGCTAATTATTAG^^ 

ATTAATG 
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14. A method as claimed in any of the claims 
1-13, characterised by the 3' flanking 
region of the genes to be expressed being a 3' 
flanking region of root nodule - specif ic genes of 
5 any origin. 



15. A method as claimed in claim 14, char- 
acterised by the 3' flanking region being 
of leghemoglobin genes. 

16. A method as claimed in claim 14, c h a r - 
IQacterised by the 3' flanking region being 

of soybean leghemoglobin genes. 

L7 . A method as claimed in claim 16 , char- 
acterised by the 3' flanking region being 
of the Lba, Lbc^, Lbc£ or Lbc3 gene with the fol- 
15 lowing sequences, respectively: 



Lba 



1590 1620 
TAA TTA GTA TCT ATT GCA GTA AAG TGT AAT AAA TAA ATC TTG 



1650 



16BO 



20 TTT CAC TAT AAA ACT TGT TAC TAT TAG ACA AGG GCC TGA TAC AAA ATG TTG GTT AAA ATA 

17X0 1740 
ATG GAA TTA TAT AGT ATT GGA TAA AAA TCT TAA GGT TAA TAT TCT ATA TTT GCG TAG GTT 

1770 1800 
TAT GCT TGT GAA TCA TTA TCG GTA TTT TTT TTC CTT TCT GAT AAT TAA TCG GTA AAT TA 

1830 I860 
25 ACA AAT AAG TTC AAA ATG ATT TAT ATG TTT CAA AAT TAT TTT AAC AGC AGG TAA AAT GTT 



ATT TGG TAC GAA AGC TAA TTC GTC GA 
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Lbci 



1320 

TAA/TT AGG ATC TAC TGC ATT GCC GTA 



13SO 1380 
AAG TGT AAT AAA TAA ATC TTG TTT CAA CTA AAA CTT GTT ATT AAA CAA GTT CCC TAT ATA 

1410 1440 
AAT GTT GTT TAA AAT AAG TAA ATT TCA TTG TAT TGG ATA AAC ACT TTT AAG TTA TAT ATT 

1470 1500 
TCC ATA TAT TTA CGT TTG TGA ATC ATA ATC GAT ACT TTA TAA AAA TAA ATT CCA AAT AAT 

TTA TAC GTT TTA AAA ATT ATT TT 



Lbc< 



TAG/GAT CTA CTA TTG CCG TCA AGT 

X140 

GTA ATA AAT AAA TTT TGT TTC ACT AAA ACT TGT TAT TAA ACA AGT CCC CGA TAT ATA AAT 

1170 1200 

GTT GGT TAA AAT AAG TAA ATT ATA CGG TAT TGA TAA ACA ATC TTA AGT TTT ATA TAT AGT 

1230 1260 

TCC ATA TAC TAA AGT TTG TGA ATC ATA ATC GA 

1290 



and Lb c 3 



TAG/GAT CTA CAA TTG CCT TAA AGT GTA ATA AAT AAA 
990 102O 

TAT TAT TTC ACT AAA ACT TGT TAT TAA ACC AAG TTC TCG ATA TAA ATG TTG GTT AAA CTA 

lOSO 1080 

AGT AAA TTA TAT GGT ATT GGA TAA ACA ATC TTA AGC TT 

1110 



18. A method as claimed in claim 1 of preparing 
a polypeptide by introducing into a cell of a plant, 
a part of a plant or a plant cell culture a recombi- 
nant plasmid, characterised by using 
as the recombinant plasmid a plasmid comprising an 
inducible plant promoter (as defined) of root nod- 
ule-specific genes. 
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19. A DNA fragment comprising an inducible plant 
promoter (as defined) to be used when carrying out 
the method as claimed in claims 1-18, char- 
acterised by being identical with, de- 

5 rived from or comprising a 5' flanking region of 
root nodule - specif ic genes of any origin. 

20. A DNA fragment as claimed in claim 19, 
characterised by being identical with, 
derived from or comprising a 5' flanking region of 

lO plant leghemoglobin genes. 

21. A DNA fragment as claimed in claim 20, 
characterised by being identical with, 
derived from or comprising a 5' flanking region of 
soybean leghemoglobin genes. 

15 22. A DNA fragment as claimed in claim 21, 

characterised by being identical with, 
derived from or comprising a 5' flanking region of 
the Lba gene with the sequence: 



GAGATACATT ATAATAATCT CTCTAGTGTC TATTTATTAT TTTATCTGGT 

20 GATATATACC TTGTCGTATA CTGTTATTTT TTCAATCTTG TAGATTTACT 

TCTTTTATTT TTATAAAAAA GACTTTATTT TTTTAAAAAA AATAAAGTGA 

ATTTTGAAAA CATGCTCTTT GACAATTTTC TGTTTCCTTT TTCATCATTG 

GGTTAAATCT CAT AGTG CCT CTATTCAATA ATTTGGGCTC AATTTAATTA 

GTAGAGTCTA CATAAAATTT ACCTTAATAG TAGAGAATAG AGAGTCTTGG 

AAAGTTGGTT TTTCTCGAGG AAGAAAGGAA ATGTTAAAAA CTGTGATATT 

TTTTTTTTGG ATTAATAGTT ATGTTTATAT GAAAACTGAA AATAAATAAA 

25 CTAACCATAT TAAATTTAGA ACAACACTTC AATTATTTTT TTAATTTGAT 

TAATTAAAAA ATTATTTGAT TAAATTTTTT AAAAGATCGT TGTTTCTTCT 

. TCATCATGCT GATTGACACC CTCCACAAGC CAAGAGAAAC ACATAAGCTT 

TGGTTTTCTC ACTCTCCAAG CCCTC TAT AT AAA CAAATAT TGGAGTGAAG 

TTGTTGCATA ACTTGCATCG AACAATTAAT AGAAATAACA GAAAATTAAA 
AAAGAAATAT G, 
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23. A DNA fragment as claimed In claim 21, 
characterised by being identical with, 
derived from or comprising a 5' flanking region of 
the Lbc^ gene with the sequence: 

TTCTCTTAAT ACAATGGAGT TTTTGTTGAA CATACATACA TTTAAAAAAA 
5 AATCTCTAGT GTCTATTTAC CCGGTGAGAA GCCTTCTCGT GTTTTACACA 
CTTTAATATT ATTAT AT CCT CAACCCCACA AAAAAGAATA CTGTTATATC 
TTTCCAAACC TGTAGATTTA TTTATTTATT TATTTATTTT TACA A AG GAG 
ACTTCAGAAA AGTAATTACA TAAAGATAGT GAACATCATT TTATTTATTA 
TAATAAACTT TAAAATCAAA CTTTTTTATA TTTTTTGTTA CCCTTTTCAT 
TATTGGGTGA AATCTCATAG TGAAGCCATT AAATAATTTG GGCTCAAGTT 
TTATTAGTAA AGTCTGCATG AAATTTAACT TAACAATAGA GAGAGTTTTC 
lOGAAAGGGAGC GAATGTTAAA AAGTGTGATA TTATATTTTA TTTCGATTAA 
TAATTATGTT TACATGAAAA CA TACA AAA A AATACTTTTA AATTCAGAAT 
AATACTTAAA ATATTTATTT GCTTAATTGA TTAACTGAAA ATTATTTGAT 
TAGGATTTTG AAAAGATCAT TGGCTCTTCG TCATGCCGAT TGACACCCTC 
CACAAGCCAA GAGAAACTTA AGTTGTAAAC TTTCTCACTC CAAGCCTTCT 
ATATAAACAT GTATTGGATG TG A AG TT ATT GCATAACTTG CATTGAACAA 
TAG A A AAT A A CAAAAAAAAG TAAAAAAGTA GAAAAGAAAT ATG , 



15 24. A DNA fragment as claimed in claim 21, 

characterised by being identical with, 
derived from or comprising a 5' flanking region of 
the Lbc£ gene with the sequence: 

TCGAGTTTTT ACTGAACATA CATTTATTAA AAAAAACTCT CTAGTGTC 
TTTATTCGGC GAGA AG CCTT CTCGTGCTTT ACACACTTTA ATATT AT TAT 

20 ATCCCCACCC CCACCAAAAA AAAAAAAACT GTTATATCTT TCCAGTACAT 
TTATTTCTTA TTTTTACAAA GGAAACTTCA CGAAAGTAAT TACAAAAAAG 
ATAGTGAACA TCATTTTTTT AGTTAAGATG AATTTTAAAA TCACACTTTT 
TTATATTTTT TTGTTACCCT TTTCATTATT GGGTGAAATC TCATAGTGAA 
ACTATTAAAT AGTTTGGGCT CAAGTTTTAT TAGTAAAGTC TG CATGA A AT 
TTAACTTAAT AATAGAGAGA GTTTTGGAAA GGTAACGAAT GTTAGAAAGT 
GTGATATTAT TATAGTTTTA TTTAGATTAA TAATTATGTT TACATGAAAA 
TTGACAATTT ATTTTTAAAA TTCAGAGTAA TACTTAAATT ACT T ATT T AC 

25 TTTAAGATTT TGAAAAGATC ATTTGGCTCT TCATCATGCC GATTGACACC 
CTCCACAAGC CAAGAGAAAC TTAAGTTGTA ATTTTTCTAA CTCCAAGCCT 
TCTATATAAA CACGTATTGG ATGTGAAGTT GTTGCATAAC TTGCATTGAA 
CAATAGAAAT AACAACAAAG AAAATAAGTG AAAAAAGAAA TATG , 



25. A DNA fragment as claimed in claim 21, 
characterised by being identical with, 
30 derived from or comprising a 5' flanking region of 
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the Lbc3 gene with the sequence: 

TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 
GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 
GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 
5 ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 
AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 
TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 
TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 
CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 
AATTTTAACT TAAAAATAGA G A A A ATCTG G AAAAGGGACT GTTAAAAAGT 
GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 
in CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 
TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 
T^CACCATAC CAATTGATCA CCCTCCTCCA ACAAGCCAAG AG AG AC AT A A 
GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA TA A AT A A CTA 
TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 
CAGAAAAGTA GAAAAGAAAT ATG. 



15 26. A DNA fragment as claimed in claim 21, 

characterised by the DNA fragment 
comprising the inducible plant promoter being iden- 
tical with, derived from or comprising 5' flanking 
regions of Lbc 3 - 5 ' - 3 ' - CAT gene with the sequence: 

20 TATGAAGATT AAAAAATACA CTCATATATA TGCCATAAGA ACCAACAAAA 

GTACTATTTA AGAAAAGAAA AAAAAAACCT GCTACATAAT TTCCAATCTT 

GTAGATTTAT TTCTTTTATT TTTATAAAGG AGAGTTAAAA AAATTACAAA 

ATAAAAATAG TGAACATCGT CTAAGCATTT TTATATAAGA TGAATTTTAA 

AAATATAATT TTTTTGTCTA AATCGTATGT ATCTTGTCTT AGAGCCATTT 

TTGTTTAAAT TGGATAAGAT CACACTATAA AGTTCTTCCT CCGAGTTTGA 

TATAAAAAAA ATTGTTTCCC TTTTGATTAT TGGATAAAAT CTCGTAGTGA 

25 CATTATATTA AAAAAATTAG GGCTCAATTT TTATTAGTAT AGTTTGCATA 

AATTTTAACT TAAAAATAGA GAAAATCTGG AAAAGGGACT GTTAAAAAGT 

GTGATATTAG AAATTTGTCG GATATATTAA TATTTTATTT TATATGGAAA 

CTAAAAAAAT ATATATTAAA ATTTTAAATT CAGAATAATA CTTAAATTAT 

TTATTTACTG AAAATGAGTT GATTTAAGTT TTTGAAAAGA TGATTGTCTC 

TTCACCATAC CAATTGATCA CCCTCCTCCA ACAAGCCAAG AG AG ACAT AA 

GTTTTATTAG TTATTCTGAT CACTCTTCAA GCCTTC TATA TAAATAAG TA 

TTGGATGTGA AGTTGTTGCA TAACTTGCAT TGAACAATTA ATAGAAATAA 

3Q CAGAAAAGTA GAATTCT AAA ATG 



27. A DNA fragment as claimed in claim 19, 
characterised by being identical with, 
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derived from or comprising 5' flanking regions of 
the N23 gene with the sequence: 



lO 20 30 40 SO 60 70 

GAATTCGAGCTCGCCCGGGGATCGATCCT CTAGAST^G^CTGCAGCCCAAGCTTGGATCAATCAATTAA 
Erow r Sail 

go go lOO HO 120 130 140 

5 TTCTATTGAGACACGATTTGAACAAXTTTTACATTATGA 

JUUVTTTAAAGCTTTAGAX^ 

220 230 240 250 260 270 280 

ATGAATGCTATGATATTGATGGTCTTGATNTATT^CAGAATTGAAAGTATTAAGAGAAGTGTTAAGA^ 

290 300 3lO 320 330 340 350 

lO AGAAGTT AGCACACCAATAGAAGT ATTGAGTTAT ATTAAAACTTC G 

3 60 370 380 390 400 410 420 

CATATAGAATTTTATTGACAATCCTTATAACA^^ 

430 440 450 460 470 480 490 

ACTTAAATCATATCTAAAATCAACAATGTTACAAGATAGATTGAATGAGTTAG 

500 SIO 520 530 540 550 560 

15 AGT AAAGTGTTAGAATT GTTTGATTAT AAAACTCTGATAAATGATTTT GCAGTTAAAAAAACTAGAAGAT 

570 580 590 600 6lO 620 630 

TAATATAAAAATTGAIATTTTATATAATATATTAAGTCT 

640 6 SO 660 670 680 690 700 

AAATAATAAAATAAAGCAACTCTTAATTTTAATGAAACATCCCTTTGTTAAACCG 

710 720 73a 740 750 760 ^ 77Q 

20 AAAAATTAATGCTTGAT GGAAGTTTTTAATTTGTTCTACT CAATACTCAAAGGGTTGTAAAT ATTTTTTT 

780 790 800 810 820 830 840 

TATCATTTATATGTTGTAAATATGAATGCACTAGTAATTAGT1TAATGAT 

850 860 870 880 890 9O0 910 

ATTTCTGTCTCTTGGCAACTCGTGAGAATTGAATATATT^ 

920 930 940 950 960 970 980 

25 AGAATAAATATTTATATACAAXTCCTAGATTTTGCT^ 

990 1O0O I010 1020 1030 X040 ^JS3R 

GAGCACACACCAAACTAGTCTCAAATTAAGTAAGGTGC^^ 

DdeX 

ATTAATG 



28. A plasmid which can be used when carrying 



# 
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out the method as claimed in claims 1-18, 
characterised by comprising a DNA 
fragment as claimed in any of the claims 19-27. 

29. A plasmid as claimed in claim 28, c h a r- 
Sacterisedby being pAR29 . 

30. A plasmid as claimed in claim 28, c h a r- 
acterisedby being pAR30. 

31. A plasmid as claimed in claim 28, c h a r- 
acterisedby being pARll. 

10 32. A plasmid as claimed in claim 28, char- 
acterised by being N2 3 - CAT . 

33 . A transf ormant Agrobact erium rhizogenes 15834- 
strain which can be used when carrying out the 
method as claimed in any of the claims 1 to 18, 

15 characterised by the bacterium strain 
being transformed by a plasmid according to any of 
the preceding claims 28 to 32. 

34 . A transf ormant Agrobac ter ium rhizogenes 15 8 34- 
strain which can be used when carrying out the 

20 metnod as claimed in any of the claims 1 to 18 , 

characterised by the bacterium strain 
being transformed by pAR29 and being named AR1127. 

35 . A transf ormant Agrobac ter ium rhizogenes 15834- 
strain which can be used when carrying out the 

25 method as claimed in any of the claims 1 to 18 , 

characterised by the bacterium strain 
being transformed by pAR30 and being named AR1134. 
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36 . A transformant Agrobactertum rhizo^enes 15834- 
strain which can be used when carrying out the 
method as claimed in any of the claims 1 to 18, 
characterised by the bacterium strain 

5 being transformed by pARll and being named AR1000. 

37 . A transformant Aerobacter ium rhizogenes 15834- 
strain which can be used when carrying out the 
method as claimed in any of the claims 1 to 18, 
characterised by the bacterium strain 

lO being transformed by N23-CAT and being named AR204- 
N2 3-CAT. 

38. Plants, parts of plants and plant cells, 
particularly of the family Leguminosae, obtainable 
by transformation" with a recombinant DNA segment, 
15 fragment or plasmid according to any one of the 
claims 1 to 37. 



